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INTRODUCTION 


It is a far cry in psychology to the 
establishment of the first psychological 
laboratory in America and to the founding of 
the American Journal of Psychology, both ac- 
complishments of G. Stanley Hall. Various 
schools of psychology have developed, which 
include the introspective and structural 
groups, behaviorism, Gestalt, psychoanaly- 
sis, and purposivism. The early beginnings 
in educational psychology, as represented by 
Stanley Hall's introduction of experimental 
psychology into America and his initiation 
of the child-study movement, have extended 
to a considerable number of educational 
problems and fields. In addition to publi- 
cations which discuss educational psychology 
and learning in somewhat general terms, there 
are specialized psychological treatments of 
such problems as mental hygiene, study, el- 
ementary-school subjects, secondary-school 
subjects, arithmetic, algebra, athletics, 
play, music, typewriting, skill, handwrit- 
ing, reading, spelling, character, personal- 
ity, social institutions, language, adoles- 
cence, childhood, adult learning, etc. 





The current issue of the Journal of Ex- 
perimental Education is devoted to studies 
of the psychology of learning and teaching. 
The articles in this number represent tech- 
niques of research and fields which merit 
brief introductory comment. 








The two studies by Hansen and by Brown 
@1l are evaluations of teaching method. Han- 
sen, using a parallel-group technique of ex- 





perimentation, has investigated a problem in 
the important field of visual education. 
Brownell has evaluated painstakingly and mi- 
nutely the effect of a device employed in 
the teaching of fractions in arithmetic. Not 
the least significant part of this report is 
the detailed account of the various steps 
and leads followed in the attempt tv solve 
the problem, 


Rogers through elaborate statistical 
calculations has analyzed "factors" related 
to school achievement. 


Eaton reports on experimental study in 
the field of motivation of learning. 


Two studies deal with pupil attitudes, 
a problem of increasing concern in the mod- 
ern school, Remmers et al. present 4 scale 
for measuring attitude toward high school. 
Whittaker reports a detailed investigation 
of the political and economic opinions of 
secondary-school pupils, 


ee 


Carter V. Good 
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THE EFFECT OF EDUCATIONAL MOTION PICTURES UPON THE 
RETENTION OF INFORMATIONAL LEARNING 
by 
John Elmore Hansen? 
University of Wisconsin 


PURPOSE OF STUDY 


The purpose of the investigation re- 
ported herein is to determine the contribu- 
tion of educational motion pictures to the 
retention of informational learning, 


PREVIOUS STUDIES 


During the past fifteen years numerous 
studies have been made to determine the con- 
tribution of educational motion pictures to 
learning, noteworthy among which are the 
studies of Weber*, Freeman®, Wood and Free- 
man¢, Knowlton and Tilton5, and the Middle- 
sex and Historical Association studies® in 
Great Britain. These studies have shown, 
without exception, that the educational mo- 
tion pictures, when properly used, are ef- 
fective aids to learning. The Knowlton and 
Tilton study in New Haven and the Historical 
Association study in England appear to be 
the only studies, however, in which much at- 
tention was given to the influence of motion 
pictures upon the retention of what was 
learned. Knowlton and Tilton measured the 
retention of historical knowledge over in- 
tervals varying from approximately three to 
seven months. The results of this study in- 
dicate that the experimental group had 





learned more, forgotten more, and remembered 
more than the control group.’ A summarized 
statement from The Film in National Life con 
cerning the British Historical Association 
study says: "The film leaves a remarkably 
permanent impression. Deferred tests were 
given in both inquiries up to seven months 
after the film had been shown, and produced 
surprisingly good results, particularly where 
the subject of the film had been related to 
the school curriculum and had been followed 
up by regular lessons."® With this rather 
limited information it would seem that fur- 
ther evidence is desirable to test the per- 
manency of learning made with the aid of mo- 
tion pictures. 





SUBJECTS 


The subjects employed in this investi- 
gation consisted of approximately two hun- 
dred fifty pupils, including the entire stu- 
dent body of the Middleton High School and 
the pupils of the seventh, eighth, and ninth 
grades of the Mt. Horeb Public Schools, The 
data presented herein are for eighty-two 
matched pairs of subjects. The chief facts 
concerning these subjects are given in Ta- 
ble I. The experimental and control groups 
employed in this investigation were eightytwo 





1. The writer wishes to express his appreciation of the kind cooperation and guidance extended by Dr. A. S. Barr, Pro- 


fessor of Education at the University of Wisconsin in the conduct of this experiment. 


Thanks are also due to Prin- 


cipal L. W. Amborn of the Middleton High School and to Principal Harley J. Powell of the Mt. Horeb Public Schools, 


in whose schools the study was made. 
2. Joseph J. Weber, Visual Aids in Education (Chicago: 
3. Frank N. Freeman, Visual Education (Chicago: 








The Educational Screen, Inc., 1980). 
The University of Chicago Press, 1924). 
4. Ben D. Wood and Frank N. Freeman, Motion Pictures in the Classroom (Boston: 


Houghton Mifflin Company, 1929). 





5. Daniel C. Knowlton and J, Warren Tilton, Motion Pictures in History Teaching (New Haven, Connecticut: Yale Univer- 





sity Press, 1929). 


6. Commission on Educational and Cultural Films, The Film in National Life (London: 
7. Daniel C. Knowlton and J, Warren Tilton, Métion Pictures in History Teaching (New Haven: 


George Allen and Unwin Ltd., 1952). 
Yale University Press, 





1929), p. 57. 


8. Commission on Educational and Cultural Films, The Film in National Life (London: 





p. 67. 


George Allen and Unwin Ltd., 1982), 











2 JOURNAL OF EXPERIMENTAL EDUCATION 





Volume II, No. 1 


TABLE I 
AGE, SEX, AND INTELLIGENCE OF THE SUBJECTS EMPLOYED 
IN THIS INVESTIGATION 


























| No. of No. of No. of M. A. I. Q. | Initial Test 
Group Pupils Boys Girls Average Average Average 
- — $—-——— —__ — —___4—_ -—_-____—_ ——_—— —— ————_— 
Experimental | 82 38 | 44 181.3 101.6 38.4 
Control | 82 36 | 46 180.8 101.9 38.0 
pairs of subjects matched according to THE TEST 


mental age, intelligence quotient, and pre- 
vious knowledge of the topic to be studied 
as measured by an initial test of informa- 
tion. The mean M, A. of the Experimental 
Group was 181.3 and of the Control Group, 
180.8; the mean I. Q. of the Experimental 
Group was 101.6 and of the Control Group, 
101.9. The Experimental Group was slightly 
higher in M. A. and slightly lower in I. Q. 
The number of girls in the Experimental 
Group was 44 and in the Control Group, 46; 
the number of boys was 38 and 36 respective- 
ly. Since psychologists? are almost unani- 
mous in agreeing that there is little differ- 
ence in the ability of the sexes to learn, 
it would seem that little would be gained in 
matching according to sex. 


MATERIALS OF INSTRUCTION 


. The topic studied by the two groups was 
' that of the making of pottery. This topic 
was one with which none of the pupils were 
likely to be familiar and one about which 
they had little previous information. The 
motion picture film used was the Eastman 
Teaching Films, Inc., production, "Table- 
ware." Following the opening scene which 
depicts primitive pottery making, it shows 
in detail the various steps involved in the 
manufacture of earthenware in a modern pot- 
tery factory. It is a 16m, film, one reel 
in length, and required fourteen minutes for 
presentation. The reading matter consisted 
of a two-thousand word written exposition 
which described in detail all of the steps 
in the various processes pictured in the 
film. Every possible effort was made to 
make the written exposition and the film as 
nearly comparable as it is possible to make 
two such different kinds of presentation, 





The test employed in the investigation 
consisted of three parts, totalling 102 
items: Part I was made up of twenty-seven 
statements, each of which was to be complet- 
ed by indicating one of five giventerms as 
the correct one; Part II consisted of eleven 
questions, each of which had five statements 
concerning the same operation or object, any 
or all of which might be correct or wrong in 
which each pupil was asked to mark each of 
the statements indicating whether it was 
true or false; Part III consisted of twenty 
true-false statements which were to be mark- 
ed either true or false. 

The test was given three times. It was 
given first ten days previous to the presen- 
tation of the film and the reading material, 
This test serv3d two purposes: first, as an 
aid in equating the groups, and, second, as 
a means of determining the initial knowledge 
of the topic already possessed by the pupils. 
The lapse of a ten-day period before admin- 
istering the test a second time also served 
to minimize any practice effect. The second 
application of the test was made immediately 
after the pupils had received their instruc- 
tion in the topic to be studied before they 
had any opportunity to forget what they had 
learned, The third test was given three 
weeks and four days, in the case of the Mid- 
dleton School, and three weeks and two days, 
in the case of the Mt. Horeb Schools, after 
the second test to measure the effects of 
the two modes of presentation upon the re- 
tention of what had been learned as shown by 
the second test. 

The reliability of the test was deter- 
mined by the split-test method for the scores 
on the second and the third applications of 
the test. The coefficient for the second 





1. Rudolph Pintner, Intelligence Testing (New York: 





Henry Holt and Company, 1951), pp. 496-501. 
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application of the test (T.) is .765, and | 
for the third application of the test (T;), 
.815. The intercorrelation coefficient be- 
tween the results on the first application 
(T,) and the second application (T,) is .359, 
and between the second (T,) and the third 
(T;), .939. Since the results gained on T) 
are an indication of the information already 
possessed by the subjects, which had been 
gained through random reading and observa- 
tion, and since the additional information 
measured in Tp, and Ts had been obtained un- 
der the same closely controlled conditions, 
it is to be expected that a lower correla- 
tion would exist between T, and T, than be- 
tween Tp and T;. These reliability coeffi- 
cients taken as a whole are sufficiently 
high for the purposes of this study. 

No attempt has been made to determine 
the validity of the test. The items of the 
test were, however, carefully selected to 
cover the information given in the written 
exposition and the film. In making this 
statement the impossibility of constructing 
a verbal test which will adequately measure 
all of the contributions made through the 
means of a pictorial presentation is fully 
realized. It would appear that verbal tests 
favor learning gained through verbal presen- 
tation. 























PRESENTATION OF SUBJECT MATTER 






The presentation of the subject matter 
to the pupils was as follows: 







Experimental Group 

5 minutes - Orientation period 

14 minutes - Viewing of film 

10 minutes - Oral discussion based upon 
study outline in hands of 
pupils 
Studying written exposition 














14 minutes 


43 minutes 






Total length of period 





Control Group 
5 minutes 
14 minutes 





Orientation period 
Reading of written exposi- 
tion 

Oral discussion based upon 
study outline in hands of 
pupils 







10 minutes 









John Elmore Hansen 





14 minutes - Studying written exposition 
the second time, using 
study outline as a guide 


43 minutes - Total length of period 


It will be noted that the only differ- 
ence in the treatment of the two groups was 
the substitution of a fourteen-minute film 

| presentation in the experimental set-up for 
| fourteen minutes of reading in the control 
| set-up. 

The total control and experimental 
groups were each taught in three separate 

| sections as follows: 


‘Middleton High School 

1. 9th - 12th Grades, inclusive 
Experimental Class 42 pupils 
Control Class 42 pupils 


| Mt. Horeb Public Schools 

2. 9th Grade 
Experimental Class 23 pupils 
Control Class 23 pupils 


3. 7th and 8th Grades 
Experimental Class 17 pupils 
Control Class 17 pupils 


The subject matter was presented to all 
of the group sections by the same instructor 
/and the tests were administered by the same 
individual. This insured, as nearly as pos- 
sible, that the pupils in the various séc- 
tions received the same treatment, 


’ 


RESULTS 


The mean scores of the eighty-two pu- 
pils in the Control Classes and the eighty- 
two pupils in the Experimental Classes on 
the three successive tests are presented in 
Table II. The mean scores for the Control 
Group were: for T,, 38.0; for Tp, 66.1; 
| and for T;, 60.9. The mean scores of the 

Experimental Group were: for T,, 38.4; for 
To, 70.0; and for T;, 65.2. This represent- 
/ed a gain between T, and T, of 28.1 points, 
and a net gain between T, and Ts, of 22.8 

for the Control Group; and corresponding 
gains of 31.6 and 26.8 respectively for the 
Experimental Group. A comparison of these 
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TABLE II 
MEAN SCORES AND STANDARD DEVIATIONS FOR THE 
EXPERIMENTAL AND CONTROL GROUPS ON TESTS I, II, 
AND III; THE GAINS BETWEEN TESTS; AND THE 
DIFFERENCES IN GAINS BETWEEN GROUPS 





X GROUP C GROUP 















8.D. 














I || 38.4 8.D.(ais.) 11.86 38.0 S.D-(dis.) 12.37 4 
8.D-(ay.) 1.31 8.D-(ay,) 1.36 

II | 70.0 S.D-(ais.) 10.34 66.1 8.D-(dis.) 11.85 3.9 
8.D-(av.) 1.14 8.D.(ay.) 1.31 

III | 65.2 8.D-(ais.) 11.06 60.9 8.D-(ais.) 10.17 4.3 
S.D+(ay.) 1.22 8.Ds(av.) 1.12 

II-I | 31.6 8.D+(dis.) 12.98 28.1 8.D-(ais.) 13.80 3.5 


S.D-(av.) 1.43 S.D-(av.) 1.52 


















III-I | 26.8 §.D-(dis.) 13.26 22.8 5.D.(dis.) 12.69 4.0 
8.D-(ay.) 1.46 8.D-(ay,) 1.40 

































results shows that the Experimental Group Two conclusions may be drawn from a 
made an average gain of 3.5 points more be- | study of the above results: (1) The educa- 
tween T, and Tp than did the Control Group, | tional motion picture is an aid to learning, 
and that when tested after an elapse of especially in the acquisition of information 
nearly a month the net average gain between | such as portrayed in the film used in this 
T, and T, is 4.0 points more for the Experi- | study. This bears out the results of other 
i mental Group than for the Control Group. studies mentioned earlier in this paper. 

; This difference between the Experimental | (2) It seems that the knowledge gained 

and Control Group gains from T, to Tp is .31 | through the means of pictorial presentation 
of the standard deviation of the combined is fully as permanent in its nature as that 
Experimental and Control Tp scoresl; the dif- | gained through reading and oral presentation. 
ference between the Experimental and Con- Apparently the impressions made by combina- 
trol gains from T, to Tz; is .37 of the tions of pictures and words are more lasting 
standard deviation.® In terms of per cent than those produced by words alone. It .is 
the differences in gains in favor of the Ex- | possible that the association of verbal forms 
perimental Group are 12.1% and 17.5% respec- | with the appropriate visual images enabled 
tively. The Standard Error® of the first the pupils in the Experimental Group to gain 
difference is 1.615 and of the second, 1.62, | clearer and more definite notions of objects 
giving critical ratios of 2.17 and 2.46 re- (| and their movements; thus the materials be- 
. spectively. These differences are believed | coming more meaningful are learned more 

- to be significant. readily and retained longer than less mean- 
ingful material. 






































1. 8-Den (x + ¢), 10.89 
2. S.Den(x +c), 11.80 
5. Standard Error of the difference in gains between groups of matched pairs has been computed as equal to 


/{s.boy,)* + (S.Diy,)° - era, 





Mx, mean Experimental gain 
M,, mean Control gain 
r, coefficient of correlation between individual X gains and C gains. 





AN EVALUATION OF AN ARITHMETIC “CRUTCH" 


by 


William A. Brownell 
Duke University 


PURPOSE OF THE STUDY 


The major purpose of this article! is 
to report an investigation concerning the ef- 
fects of instruction in the use of an arith- 
metic "crutch" in adding proper fractions. 
Data will be presented and analyzed with re- 
spect to three phases of the problem: first, 
the degree to which the "crutch" tended to 
persist in the arithmetic behavior of the 
children who had been taught to employ it; 
second, the extent to which use of the 
"crutch" involved children in uneconomical 
andeven harmful procedures in adding proper 
fractions; and third, the relative amounts of 
ability to add fractions exhibited by chil- 
dren who had been taught the "crutch" and by 
comparable children who had not been taught 
this device. 


The report is divided into two parts 
which are dissimilar in intent and in con- 
tent and which are designed to accomplish 


two unlike purposes. The first part is an 
account of the steps in the investigation. 

It is little concerned with the exact data 
collected, but it is mich concerned with the 
way in which the problem was almost acciden- 
tally discovered and with the manner in which 
through the application of appropriate re- 
search techniques the problem, none too clear 
at times, was finally made to yield to a so- 
lution. This part of the report is frankly 
intended to serve two ends. The first is to 
furnish a realistic example of the way in 
which much research is done: with periods of 
alternate confusion and enlightment, with the 
commission of many errors of judgment and the 
correction of those errors, and with the fol- 
lowing of faulty leads and the attainment of 
fortunate successes. (Such a description of 





research method, it will be noted, bears 
little resemblance to the descriptions to 

be found in manuals and in many theoretical 
discussions of research.) The second end” 
served by this part of the report is to sup- 
ply evidence that not all research is spir- 
itless grubbing and boring drudgery, but 
that on occasion it affords all the excite- 
ment and all the thrills enjoyed by a detec- 
tive at work on his favorite type of crime. 

Part II, by contrast, has little about 
it to make it emotionally attractive. Here 
will be presented a large share of the data 
which were collected in the course of the in- 
vestigation, together with the evaluations 
and comparisons by means of which these data 
were studied to discover the relation of the 
use of the "crutch" to the development of 
ability to add proper fractions. While the 
statistical treatment which will be described 
is relatively simple, it is exceedingly de- 
tailed, and the reader who finds little to 
interest him in detail may be well advised 
not to read Part II at all. This part of 
the report is intended to supply the factual 
basis for Part I and to illustrate to the 
reader who is curious about such procedures 
certain useful methods of analysis which are 
not commonly employed. 

Before beginning with Part I or skip- 
ping to Part II the reader will need certain 
information regarding the setting of the in- 
vestigation, the techniques employed and the 
measures collected. This information is sup- 
plied in the following two sections. 


SOURCE OF THE DATA 


Data for this report were collected 
from 248 fifth-grade children’ who constituted 





1. The writer is deeply obligated to Dr. B. R. Buckingham, of Harvard University, and to Dr. Charles H. Judd, of the 
University of Chicago, both of whom read the original manuscript and offered much helpful criticisn. 
2. Data are actually reported for 245 children only, since the records of three pupils were not susceptible to accurate 


interpretation. 
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the total enrollment in seven schools, three 
of them in Nashville, Tennessee, and four of 
them in adjacent Davidson County, Tennessee. 

The test used was a form of the Brueck- 
ner Diagnostic Test in the Addition of Frac- 
tions? which was abridged and modified for 
the special purposes of this study. As re- 
vised, the test below consists of fifteen 
examples which classify into three major 
types. In the first five examples the com- 
mon denominator occurs in both the fractions 
to be added; in examples 6-9 the common de- 
nominator occurs in only one of the frac- 
tions; and in examples 10-15 the common de- 
nominator must be found. 


TEST USED IN THE INVESTIGATION 




















1 1 2 
(1) . (2) B (3) 7 
2 5 5 
_5_ m4 ~ & 
4 4 i 
(4) ry (5) ry (6) . 
2 * a 
» 6 = 
1 2 A 
(7) $ (8) + (9) ; 
2 3 S 
= 3 ss Pe 
3 1 2 
(10) s (11) + (12) = 
1 2 7 
12 rr Ti 
3 3 ” 
(13) 7 (14) 7 Y (15) > 
3 4 5 
7 5S 6 
THE "CRUTCH" 


The "crutch" which is the object of in- 
vestigation in this study may be illustrated 
as it was employed in performing the addi- 
tion required by the first example: 
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(1) (3) (5) (6) (7) 
i e 1 x s. « i 
5 5 x 1 5 
zg a z x ’ oe z& 
5 5 x 5 
<2 
5 


Its cumbersomeness and artificiality are at 
once apparent: the re-writing of the un- 
changed fractions (3), the double appearance 
of the multiplication sign after each frac- 
tion (4), the multiplication by # (5), and 
so on. It is true that the miltipliers in 
(S) varied somewhat from paper to paper and 
that there were other minor differences, but 
the general scheme with its unusual and per- 
haps unwarranted degree of elaborateness re- 
mained surprisingly constant. 


Part I 
EVOLUTION OF A RESEARCH PROJECT 


Discovery of the "crutch." The use of 
the "crutch" was not discovered until some 
weeks after the fraction test was adminis- 
tered in the seven schools during the last 
week of January and the first week of Feb- 
ruary, 1930. As a matter of fact, the in«- 
vestigation proper had been originally de- 
signed to isolate and classify children's 
errors and difficulties in adding proper 
fractions as these were directly observable 
or indirectly procurable through individual 
interviews. The test papers, on which the 
children recorded the work which they de- 
scribed orally, were collected only on the 
chance that they might afford additional in- 
formation of value with respect to a com 
plete picture of children's methods in add- 
ing fractions. 








of the seven schools involved in the study. 
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In the course of the tabulation of data 
from the test papers the "crutch" was noted 
in the work of the children in two of the 
schools (hereafter designated as schools 1 and 
2). Its consistent appearance on the papers 
from these schools and its complete absence 
on the papers from the other schools (schools 
3 to 7) was indeed striking and at once 
raised a number of questions. 

Reason for interest. Further study of 
the test papers in schools 1 and 2 revealed 
the fact that the "crutch" had been employed 
with all three types of example in the test. 
As it was used with the second type of ex- 
ample it regularly had the appearance (ex- 
ample 6): 








i 2 i 
2 2 


>| 0 


|e 


i 
4 


Cad Ls 
mio | ale 


Its form in connection with the third type 
of example was as follows (example 10): 


8 8 x 3 24 
og! aim > we: 2 «© 
12 12 x 2 24 

u 
24 


Now the possible usefulness of such an 
elaborate device in solving the last six 
problems is understandable, but its value in 
solving the easier examples of the first and 
second types is by no means obvious. It was 
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their later work in adding fractions. 
Two facts of considerable interest stood 
out in clear relief. First, here was a case 
of deliberate instruction in the use of a 
"crutch", a practice which on theoretical 
grounds is freely condemned by many writers 
on Educational Psychology and on the teach- 
ing of arithmetic.! Second, the situation in 
which the “crutch" was being employed was 
such as to permit at least a partial evalua- 
tion of its effectiveness and its harmfulness 
through the agency of quantitative research 
techniques. 

The literature on the place of "crutches" 
in instruction is highly controversial. It 
is likewise highly opinionated, for experi- 
mental data on the value and dangers of 
"crutches" are exceedingly rare. All too 
frequently the worth of a given instruction- 
al device is determined by some type of a 
priori classification. If the device seems 
to violate certain psychological principles 
which are dear to him, the opponent of 
“crutches” is apt to call it a "crutch"; if 
it appears to be useful, he will call it 
something else. On the other hand, the de- 
vice is a "crutch" to the exponent of “crutch 
es" if it appears to facilitate learning, and 
it is something else if it interferes with 
learning. In both cases, the decision is 
based upon preconceived notions, and in 
neither case are the merits of the device 
satisfactorily evaluated by labelling it or 
by refusing to label it a “crutch.” It ought 
to be possible to test the usefulness of in- 
structional devices by studying the degree to 
which they advance or retard sound learning. 
Fortunately in the conditions which surround- 
ed the use of the fraction form found in 
schools 1 and 2 an opportunity was afforded 
for just such an analysis of its influence 


apparent that the pupils in these two schools; upon learning. In large measure, then, this 


had been taught the complicated written de- 
vice at the time they were first introduced 
to the addition of fractions and that they 


report may well be taken as an example of an 
attempt to substitute quantitative data for 
opinion, an effort to determine the value of 





had been trained to employ it through all 





an instructional device, not by attaching a 





1. Some may be unwilling to accept the form described above as an example of a true "crutch." Unlike most "crutches" 
this form does represent an advanced instead of an immature procedure, a finished instead of a crude way of doing 





something. It had its origin, not in the naive thinking of pupils approaching the study of a new arithmetical proc- 
ess, but in the mathematical logic of that process and in the thinking of the teacher who had mastered that logic. 

Nevertheless, the form is here regarded as a "crutch." The justification for so classifying it lies in the fact 
that the device was given the children at the beginning of their study of fractions as a temporary aid, as a means of 
explaining to them the nature of the process, with the full expectation that eventually this "crutch" would be aban- 
doned for other more direct procedures. If to the reader this reason seems insufficient to justify calling the form 
a "crutch", he may substitute in his thinking some such term as "device" for the recurring "crutch" without in any 


serious way altering the validity of the comparisons which will be made. 
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name to it, but by measuring the results of 
its use. While the form will be referred to 
as a “crutch” this practice is not intended 
in any way, favorably or otherwise, to pre- 
judge the case; it will be called a "crutch" 
because of its artificiality and its pur- 
pose, and it may still be called a "crutch" 
after the facts have been surveyed with re- 
gard to the reality of its contribution to 
learning. 


(a) Persistence of the "crutch" 


Results of the first test in school l. 
The fraction test was given the 35 children 
in grade V of school 1. Their papers and 
the records of their oral reports showed 
that these children fell into three distinct 
Classes with regard to their use of the 
"crutch." Nine of the 35 children did not 
use the "crutch" at all. Thirteen others 
used the “crutch” ten times, with the four 
examples of the second type (C.D. given in 
one fraction) and with the six examples of 
the third type (C.D. to be found). The re- 
maining thirteen used the "crutch" in all 
fifteen examples quite irrespective, there- 
fore, of the three types of examples in the 
test. 

These facts suggested what might be 
called a process of selective elimination, 
that is, a series of stages in growth in 
which the elaborate form was progressively 
abandoned as the cumbersomeness of that form 
became increasingly apparent. All the chil- 
dren had received the same instruction from 
teacher and textbook and had been equally 
trained, it is to be supposed, in the use of 
the "crutch" as a means of adding fractions. 
But now, some weeks later, some of the chil- 
dren apparently had completely freed them- 
selves from it: they appeared to have grown 
beyond the point where it contributed any- 
thing vital to their work, even with compara- 
tively difficult examples. Certain others 
still needed it or preferred to use it, even 
with the simplest examples. Still others oc- 
cupied an intermediate position between the 
two extreme groups. These last children had 
reached a stage in development where they 
seemed to differentiate between types of ex- 
amples, with some of which they employed the 
form and with others of which they no longer 
needed it. 
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It was this variable elimination of the 
"crutch" which first attracted attention. 
There must have been some reason why these 
children differed so markedly in the degree 
to which they had sloughed off use of the 
complicated form in favor of more direct 
procedures. The hypothesis of selective 
elimination even if valid required some ex- 
planation itself: Why had some of the chil- 
dren entirely discarded the "crutch" while 
others were completely bound to it? If the 
extent to which the "crutch" was still oem 
ployed, some weeks after instruction in its 
use, did actually correspond to stages in 
development, how long would it be before 
those who were most dependent upon it would 
give it up? In a word, what factors were 
responsible for the persistence of the 
"crutch"? 

An answer to these questions was first 
sought in the qualities and characteristics 
of the children themselves. As complete 
measures as possible were secured on various 
aspects of general development and of arith- 
metical ability. As measures of general de- 
velopment the following were taken: C.A., 
from the school records, and I.Q. and M.A. 
from the National Intelligence Test which 
was given the children for this specific 
purpose. As measures of arithmetic ability 
two scores on the fraction test were already 
available, the number of examples correctly 
computed and correctly reduced and the num 
ber of examples correctly computed but not 
reduced. Measures of problem-saving ability 
were obtained by taking scores on the prob- 
lem-solving test in the N.I.T. General 
arithmetical ability, it was thought, was 
best indicated by teachers! marks. The 
school records in the case of most of the 
children yielded arithmetic marks for the 
two preceding terms. 

It was proposed to compare the distri- 
butions of the three groups of children 
(those who used the "crutch" 15, 10, and 0 
times on the test) for these different meas- 
ures on the chance that significant relation- 
ships might be revealed. Thus, if for ex- 
ample, the children in the 15-group were 
found to average 130 months in M.A., and 
those in the 10-group to average 140, and 
those in the O-group to average 150, one 
might suspect that the tendency to abandon 
the “crutch" was related to the attainment 
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of a certain level of mental maturity. 

Complete measures were to be had for 
only 31 of the original 35 children in the 
grade.1 These separate measures, being in 
themselves essentially meaningless, were 
next subjected to various methods of statis- 
tical analysis (the nature of which, not 
here reported, will be found in Part II). 

For the purposes of Part I it will serve 
merely to report the results of these analy- 
ses. In general, they were negative: no 
evidence could be found of a connection, mch 
less of a close causal connection, between 
the persistence of the "crutch" on the one 
nand and any of the measures here studied on 
the other hand. These findings were more 
than a little disconcerting. The methods of 
analysis had been adopted with full confi- 
dence in their efficacy and with the expecta 
tion that they would isolate the cause for 
the persistence and the elimination of the 
"crutch", in so far as causes may be inferred 
at all from close statistical agreement. 

And yet, while the data seemed to prove 
conclusively that persistence of the "crutch" 
could not be accounted for in terms of any 
of the available measures, the hope that the 
theory of selective elimination might be es- 
tablished would not down. An examination of 
the means of the various developmental and 
arithmatical measures for the groups gave 
some basis for this hope. On the average the 
10-group did better on the fraction test than 
did the 15-group and the O-group. When scores 
were based on completely correct answers all 
except one of the twelve children in the 10- 
group equalled or surpassed the mean score of 
the O-group (by a margin of three examples or 
more), and eight of the twelve surpassed the 
mean of the 15-group. When credit was al- 
lowed for unreduced answers, the 10-group 
were still considerably superior to the 0- 
group, but not quite as successful as were 
the 15-group. As determined from teachers! 
marks, the erithmetic ability of the 10-group 
was markedly better than that of the other 
two groups, all except three of the twelve 
10-children averaging higher than the mean 
for the O- and the 15-groups. They likewise 
—— some advantage over the other groups 
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in average M.A. and in average I.Q. 
these facts taken together, it was felt, 
might mean that the 10-children, who showed 


All of 


some discrimination in their use of the 
"crutch", were actually at the most desira- 
ble stage of their development.*® Those who 
had discarded the form altogether (the 0- 
group) may have done so prematurely; they 
may have progressed too rapidly. Likewise 
those who still slavishly used the "crutch", 
employing it mechanically even with the sim 
plest examples, might represent cases of 
"arrested development." 

But all this was based on hope and 
might easily constitute a case of wishful 
thinking. The facts regarding the elimina- 
tion of the "crutch" were still to be es- 
tablished, to say nothing of the theory of 
selective elimination. Furthermore, there 
remained the data from school 2 which might 
confirm, refute, or merely modify the inter=- 
pretations placed on the data for school l. 
‘The next step then was to examine the test 
data and the various other measures for 
school 2. 

Results of the first test in school 2. 
Unlike school 1 which has the semester sys- 
tem school 2 is organized on the quarter ba- 
sis, with three terms of approximately three 
months each. Each grade thus comprises three 
groups of children. ‘The lowest of these 
groups, that is, the one beginning the work 
of the grade, is known as the C-section, the 
two higher groups being designated in order 
by the letters B and A. The subject of frate 
tions is introduced in the C-section of 
grade V. Accordingly, when the fraction 
test was given in this school for the first 
time in the first week of February, 1930, to 
the C- and Besections, the children in the 
C-sections had completed their study of frac- 
tions about a month before, and the children 
in the B-sections, about four months earlier. 
At the time of the testing and interviewing 
no attention was paid as to whether the 
children came from the C-section or from 
the Besection except to make note of the 
fact on the blanks when the tests were be- 
gun. Forty-five children were tested.® 














-« See Table I, Part II. The loss in cases was as follows: 
the 10-group, and two from the thirteen in the 15-group. 


one from the nine in the 0-group, one from the thirteen in 


2. Data collected in another connection also revealed the fact that the 10-group were much less prone than were the 15- 
securing 


and the O-groups to utilize faulty and indirect procedures in 


8. See Table IV, Part II. 
as compared with Table I for school 1. 


their answers. 


Scores on the problem-solving test in the N.I.T. are not given. This is the only difference 
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When the data had been tabulated it was 
discovered that nineteen of the children had 
made use of the "crutch" in all fifteen of 
the test examples and that the remaining 
twenty-six had not used it once. There was, 
then, in this school no intermediate group 
such as was found in school 1, who employed 
the form with certain examples but not with 
others. Furthermore, all nineteen children 
who consistently used the form belonged to 
the C-section while twenty-three of the 
twenty-six who did not use it at all be- 
longed to the B-section. To the nineteen C- 
section children who had just concluded their 
study of the addition of fractions the 
"crutch" was either a necessary procedure 
or was one which was used habitually if 
needlessly. On the other hand, the twenty- 
three children in section B, being now four 
months removed from their instruction in the 
addition of fractions, no longer needed or 
employed the "crutch." 

But thus far, three cases have been 
omitted from the discussion. ‘These three 
(Cases 131, 142, and 147) were enrolled in 
the C-section, yet behaved like pupils in 
the B-section in that they did not once use 
the form in taking the test. Here seemed to 
be some important exceptions to the remark- 


ably consistent data furnished by the other 


42 children in the grade. The uniqueness of 
their performance led to a study of their 
school records. There it was learned that 
two of them (Pupils 131 and 147) had failed 
of promotion to the Besection at the end of 
the preceding quarter, were repeating the 
work of the C-section, and had therefore just 
been taught for the second time how to add 
fractions. Instead, therefore, of constitut- 
ing exceptions these two cases proved to be 
excellent examples of the theory which had 
been advanced: to the effect that children, 
once they had gotten what they could fromthe 
form, tended to discard it.4 The four months 
elapsing since their introduction to frac- 
tions had served the same purpose for them 
as it had for their fellow-pupils who had 
been promoted to the Besection. 

The data for the two sections of school 
2 could thus be made to agree with respect 
to the tendency for children to retain or to 
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abandon use of the "crutch." Those pupils 
in the Besection who had had time to grow 
away from the form had completely done so; 
the C-section pupils, by hypothesis, were 
still too close to their instruction in its 
use to be able to dispense with it, though 
they might be expected to do so after a rea- 
sonable length of time. But the abandonment 
of tne form in the B-section could, of course, 
have been due to other causes than the one 
of lapse in time for instruction,--causes 
associated, for example, with certain quali- 
ties or characteristics of their general de- 
velopment or of their general arithmetic 
ability. Comparison of the means for the 
two sections on those traits for which 
measures were obtained showed the B-section 
to have made distinctly higher scores on the 
fraction test, a fact which was to have been 
expected from their relative school classi- 
fication; but more importantly these compari- 
sons showed the B-pupils to have had marked 
advantage in brightness (1.Q.) and in mental 
maturity (M.A.). Had it not been for the 
check supplied by the data in school 1, where 
no relation was found between intellectual 
factors and abandonment of the "crutch", the 
conclusion might well have been advanced 
from the data for school 2 that the extent 
to which the form was used was directly con- 
ditioned by the intelligence of the pupil. 
Results of re-testing in school 1. While 
such a conclusion was untenable in the light 
of the different conditions in the two 
schools, still certain other inconsistencies 
made it equally dangerous to place all the 
emphasis on distance in time from instruc- 
tion in the use of the "crutch" as thechief 
factor to account for its use or its elimi- 
nation. This explanation fitted well enough 
the facts in school 2, but it did not at all 
fit the facts in school 1. In school 2 the 
time between initial instruction in adding 
fractions (and hence in the use of the 
"crutch") and the administration of the 
fraction tests differed for the B- and C- 
sections, and this time difference was al- 
most exactly reflected in the relative de- 
gree to which the children in the two sec- 
tions employed that form. But in school l 
the time between instruction andtest was 








1. It was, however, impossible to discover any explanation for the failure of Pupil 142 to use the "crutch" in the test. 
Accordingly in the comparisons later made Pupil 142 was reteined in the C-section (Table IV) whereas Pupils 181 and 
147, for the reasons given above, were omitted from the C-section. 
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uniform for all pupils, and yet there re- 
mained marked differences in the degree to 
which they were dependent on that device, 

By the time that the data had been 
analyzed far enough to disclose the lack of 
agreement in the results for the two schools 
about three months had passed since the 
first testing and interviewing in school l. 
In other words the fifth-grade children in 
this school were about four and a half months 
distant in time from their first instruction 
in the addition of fractions. So far as 
time relations between instruction and test- 
ing were concerned, then, the situation at 
that time (May) in school 1 was roughly sim- 
ilar to that in section B of school 2 when 
the test was given there originally. It was 
decided to give the fraction test a second 
time to the children in school 1. The ex- 
pectation was that these children would show 
the same effect of time upon their use of 
the "crutch" as was manifested in the case 
of school 2. Other things being equal, all 
the children in school 1, it was thought, 
should on the re-test give evidence of hav- 
ing completely discarded the form. 

But apparently “other things” were not 
"equal"; the re-testing failed utterly to 
produce the anticipated results. Six of 
the eight children in the O-group on Test 1 
used the "crutch" ten times on Test 2. All 
twelve of the 10-group on Test 1 used the 
"crutch" ten times or more on Test 2. None 
of the eleven children in the original 15- 
group had entirely freed themselves of the 
"crutch" in the four months! interval, only 
four of them using the form at all less com 
monly on Test 2. 

Instead of clearing up the situation, 
therefore, the re-test data in school 1 fur- 
ther complicated it. It had seemed reason- 
able to believe that the extensive use of 
the "crutch" on Test 1 by the children in 
school 1 was due to their closeness to in- 
struction in its use and to believe further 
that lapse in time would bring about a charge. 
Lapse in time did bring about a change, but 
a@ change in the wrong direction, if the re- 
test data were to be accepted as valid. The 
interval of four and a half months from ini- 
tial instruction in the use of the "crutch" 
had not only not led to its abandonment; it 
had fastened the "crutch" even more firmly 
upon its users. The disagreement in results 
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| for schools 1 and 2 had not been removed as 


anticipated; it had been increased. 

Results of re-tests in school 2. It 
was impossible to drop the investigation at 
this point. The diverse sets of conditions 
in schools 1 and 2 simply could not both be 
equally true. It seemed scarcely reasonable 
that in school 2 the passing of a certain 
amount of time should witness a general 
growth toward independence from the “crutch” 
while in school 1 the passing of a longer 
period of time should witness no such change, 





| Or rather on the contrary should reveal a 


reversion of its use on the part of children 
who earlier appeared to have abandoned it. 
The need for further study was clearly indi- 
cated. 

One explanation for the discrepancy in 
the two schools lay in the possibility that 
the data for school 2 with respect to use 
of the "crutch" might have been improperly 
interpreted. ‘The C-children had been found 
to be completely dependent upon the "crutch", 
and the B-children to be completely inde- 
pendent of it. The assumption was made that 
the children who were in the B-section at 
the time of the test had, three months be- 
fore, been fully as dependent on the "crutch" 
as were those who were then actually in the 
C-section. The assumption was further made 
that the C-section children three months 
later would be as free of the "crutch" as 
were the children actually in the Besection 
at the time of the test. 

Assumptions of this kind are not infre- 
quently made in studies of the type here re- 
ported, but there is always the danger that 
in a particular instance the assumption may 
be invalid. Fortunately it was not necessa- 
ry in this study to leave in doubt the valid- 
ity of the assumption. The 22 children in 
the C-section at the time of Test 1 were 
still available for re-testing three months 
later. All of them had for about one month 
been enrolled in Section B under a new teach- 
er. 

On the re-test not one of these 22 
children, who on Test 1 had used the "crutch" 
consistently, made any use of that form. 

The general direction of the change 
wrought during the lapse of time since Test 
1 was not unexpected in view of the results 
from the first test in the two sections of 








this school. But the uniformity of the 
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change, the fact that it had taken place for 
all of the children, was unexpected. 
And yet here again the cause of the 
elimination of the "crutch" need not have 
been merely time, with the opportunity it 
afforded for the discovery and adoption of 
simpler procedures in adding fractions. The 
cause might just as conceivably have been 
the influence of the new teacher. Accord- 
ingly she was informed of the nature of the 
investigation and was questioned concerning 
her observations of the children's use of 
the form. She had noticed, she said, that a 
few children (not nearly all of them, as was 
true at the time of Test 1) on first coming 
into Section B tended to employ the device. 
By far the majority of them, however, appar- 
ently had dispensed with it. The few who 
still relied upon it were easily led to give 
it up for new methods. Moreover, once 
learned, the new methods rapidly and thorough- 
ly superseded the "crutch." Again she em- 
phasized the fect that she encountered no in- 
structional difficulty: such children as 
still employed the form seemed to be doing 
so largely in a mechanical fashion and took 
readily to suggested improvement, 

Next an interview was held with the 
teacher who had given all the children in 
grade V, both the B- and the C-sections, 
their first training in adding fractions. 
TYhe first explanations, she said, as well as 
all subsequent practice in adding fractions 
were in terms of the form, The children were 
required to show the procedure in all their 
written work, and solutions were regarded as 
wrong if the form did not appear on the pa- 
per or on blackboard. No exceptions were al- 
lowed. When she was told of its disappear- 
ance from the fraction habits of her pupils 
later in section B, she expressed no sur- 
prise. On the contrary, she stated that very 
frequently during their last weeks in the C- 
section numbers of children omitted the form 
in their work and that itwas difficult to 
maintain regard for the rule for its use. 
They seemed to discover on their own initia- 
tive that this form no longer contributed to 
their success in adding fractions, and in 
spite of her active opposition they seemed 
to prefer their new self-evolved procedures. 

Further testing in school 1. All of the 
data for school 2 were now internally consis- 
tent. But so were most of the data for 
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school 1 internally consistent, and the sto- 
ries told regarding the persistence and 
elimination of the "crutch" were diametric- 


ally opposed to each other. In school 2 the 
"crutch" was generally discarded when the 
children no longer needed it; in school l 
the "crutch" not only was not discarded, but 
it seemed to become more rather than less 
necessary as time went on. In fact the re- 
test data in the latter school clearly justi- 
fied the question as, to whether the children 
in this school might ever later be found 
adding fractions without the device. 

As one means of finding an answer to 


| this question the fraction test was given to 


32 children in grade VI of this school, all 
of whom had attended grade V in that school 


| the year before and had been taught the 


"crutch" even as had the pupils who at the 
time of the investigation were in this grade. 
These sixth-grade children were, then, a 
year and several months removed from their 
instruction in the use of the "crutch" and 


| were roughly comparable to the fifth-grade 


children in the same sense that the children 
in the B- and the C-sections of school 2 
were comparable, except, of course, in 
school 1 a pericd of more than a year in- 
stead of three months separated the two 


| groups. 





Not one of the 32 children in grade VI 
used the "crutch" in the fraction test. 

The observations of the sixth-grade 
teacher relative to the children's use of 
the form agreed exactly with those of the 
teacher of the B-section in school 2. At 
the beginning of the year, that is, a few 
weeks after leaving grade V in which the - 
form was required, a small number of the 
children, she noticed, employed the forn. 
The few were induced with a minimum of ef- 
fort on her part and on theirs to substitute 
other more economical methods. Most of the 
children, she found, had already adopted 
such methods, apparently of their own ac- 
cord, for they used them before she sup- 
plied them, and she knew that in grade V 
these methods were neither suggested nor 
permitted. 

The conversation with the sixth-grade 
teacher together with the results of the 
testing in her grade tended to reduce some- 
what the differences between schools 1 and 
2. These new data from grade VI showed 
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that the general course of development away 
from the “crutch” might actually prevail in 
school 1 as it seemed to in school 2, in 
spite of the evidence to the contrary in the 
re-test data for grade V. 

Subsequent correspondence with the 
teacher of grade V in school.’ The one set 
of data, those for the re-tests in grade V 
of school 1, were the only ones which were 
now out of line with a consistent account of 
the experience of these children first in 
using the later in laying aside the "crutch." 
Realization of this fact two months after 
the re-testing, when the tabulation and in- 
terpretation of data had reached this point, 
led to a critical consideration of the con- 
ditions under which the re-tests had been 
given in this grade. These had been adminis 
tered in May, the day before the term exam- 
inations in this school. In the midst of 
the complications which naturally arose from 
the unexpected introduction of these re- 
tests into an already full school program 
certain peculiarities and odd happenings dur- 
ing the testing, which under other circun- 
stances would have been promptly noted, 
might easily, it was thought, have escaped 
the notice of the examiner. It was, there- 
fore, not until this later date that one of 
these happenings was recalled and that its 
possible relation to the data was appreciat- 
ed. Then the examiner remembered a_ state~- 
ment which was made by the teacher to the 
pupils just as the test papers were being 
distributed: "Now, children, don't forget 
what I have told you about working these ex- 
amples." 

The school had already been closed for 
the summer for several weeks, and it was nec- 
essary to communicate with the teacher by 
mail. The following letter was sent to her; 








> 


I am sure you will recall the two oc- 
casions on which I did some work in common 
fractions with the children of your grade. 


I am writing now to secure from you 
certain information which may be of consid- 
erable importance in interpreting one very 
interesting thing we found in the work of 
your pupils. What this is, I cannot now 
tell you, for you might be influenced there- 
by in answering the question I will ask you. 
I can, however, assure you that it is noth- 
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received from the teacher: 











ing to the discredit of yourself or of your 
pupils. 






Just as the children began to take the 
test in fractions for the second time (in 
May) I remember your saying to them, "Now, 
children, don't forget what I told you about 
working these examples." Perhaps the words 
are not exact, but the above conveys their 
general meaning. Now, what I want to know 
is, what was it that you were telling them 
to do? You apparently had something defi- 
nite in mind, and you knew that the chil- 
dren would understand to what you referred. 












Won't you please write me a note on 
the reverse of this letter and mail it to 
me at once in the self-addressed and 
stamped envelope I am enclosing? Just 
tell me what you meant by your statement. 
If you can't recall, then tell me that. 
You may take my word for the fact that I 
am not asking an idle question, and that I 
would not be bothering you after your many 
courtesies if it were not important. Your 
answer cannot possibly be used against you, 
even should I be inclined so to use it. 











Very truly yours, 
etc. 






Three days later the following note was 






Dear Dr. Brownell: 






In regard to your letter referring to 
the last test you gave my children, I shall 
be glad to tell you what I meant when I 
said, "Now, children, keep in mind what I 
have told you about working these examples." 








What I had reference to was this--for 
them to show you exactly how they made the 
fractions similar by multiplying both terms 
by the same number. Still another thing I 
had in mind was for them to notice those 
fractions that were already similar and to 
simply work them as they were. 













I hope that I have made this clear to 
you. If I have not, please notify me, and 
I shall be glad to clear this up in some 
way. 







I should like to know just how the 
children came out in the last test, or 
rather both of the tests. 







Sincerely yours, 
etc. 







The reader's attention is directed in 
particular to the second paragraph in the 
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teacher's letter, in which the children were 
being warned to "show exactly how they made 
the fractions similar by multiplying both 
terms by the same number.” The teacher's 
direction could have meant but one thing to 
these children who had been required in their 
daily work with fractions to employ the 
"crutch": they were being told to use the 
"crutch" on the test. And this they did, 
not because they had not outgrown the need 
of it, nor because they were not bright 
enough or mentally mature enough to get 
along without it, but because they were ac- 
customed to doing as they were told to do. 

Conclusion to (a). The teacher's let- 
t-r constituted the final crucial bit of 
evidence in the case: it accoumted for the one 
exception which had blocked an earlier an- 
swer to the question which was being inves- 
tigated. A curious circumstance which had 
little connection with the "crutch" as such 
had prevented the children on the re-test 
from showing the freedom from the form which 
it is probable they had developed; and as a 
consequence this one curious circumstance had 
postponec the end of the inquiry and had made 
necessary many excursions and side-trips 
which, but for this circumstance, would nev- 
er have been made. 

But now the data with respect to the 
persistence of the "crutch" were all in, and 
an answer had been obtained to the first 
question proposed for study. In the case 
of the children investigated one may say 
with assurance that the "crutch" had not be- 
come fixed as a permanent habit; rather that 
it tended to drop out of their fraction hab- 
its when other simpler procedures had been 
found. On this one point the device meets 
ene test which is raised by the opponents of 
such elaborate instructional forms; it meets 
this one test, but it does not thereby prove 
anything regarding its value or its contribu- 
tion to sound economical learning. 





(b) Types of faulty procedures in adding 
proper fractions associated with 
use of the "crutch" 


A second kind of objection. But those 
who oppose "crutches" would properly argue 
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that their objections to these devices are 
several in number. They might concede the 
validity and adequacy of the evidence ad- 
duced regarding the persistence of the par- 
ticular device here under investigation. 
They might grant that this "crutch" did 

not tend to get itself permanently and in- 
eradicably into the behavior of its users, 
and still they might insist that the form 
should not be taught. They could argue, for 
example, that the "crutch" would certainly 
confuse rather than clarify the process of 
adding fractions; that it would block learn- 
ing by encouraging children to remain ata 
lower level of efficiency and maturity than 
they would attain without it. Thus, this 
"crutch" might involve children in related 
roundabout and erroneous methods of think- 
ing, and in turn these methods, employed to 
excess, might then interfere with the devel- 
opment of skill and understanding in adding 
fractions. Specifically, use of the "crutch" 
might lead to habitual failure to reduce an- 
swers, to long or incorrect procedures in 
securing common denominators, and so on. If 
the device tended to produce these effects 
in children, they would maintain, it cer- 
tainly should not be taught them. 

Such an argument cannot lightly be 
brushed aside as irrelevant and immaterial, 
Neither, in the instance of the particular 
"crutch" here being studied, could the argu- 
ment be disregarded as inappropriate. On 
the contrary, that the "crutch" might actu- 
ally be open to the objection seemed more 
than possible in view of the elaborateness 
of the form. There was, clearly, just one 
thing to do, namely, to test the soundness 
of the objection by resorting again to ob=- 
jective, quantitative evaluation. 

Nature of the data. Fortunately the 
right sort of data were available. As has 
been previously explained, the object of the 
investigation of which the study here re= 
ported was a partwas to secure a list of 
pupils! errors and difficulties in adding 
proper fractions. These errors and diffi- 
culties were collected by means of confer- 
ences which were held individually with all 
of the 245 children in schools 1-7 at the 
time they were given the fraction test. 








1. These interviews required nearly 70 hours of time. They could have been made only through the assistance of others, 
This assistance was provided by four graduate students enrolled in the writer's course, "Research in Arithmetic." It 


is a pleasure to give the credit which is due these four: 


Hodges, and Mr. J. G. Garrett. 


Mrs. Flora Gillentine, Miss Virginia Howard, Miss Myra 
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The resulting list of "faulty procedures" or 
"faults" as they will hereafter be designat- 
ed originally comprised a total of 55.1 Six 
major categories were recognized, these be- 
ing "Faults in finding common denominators" 
(of which, for example, there were ten spe- 
cific types and a miscellaneous group which 
occurred only rarely), "Faults in operating 
on denominators", "Faults in operating on 
numerators", "Faults in reducing fractions 
to lowest terms", "Faults in obtaining whole 
numbers in reducing improper fractions to 
mixed numbers", and a group of three "Mis- 
cellaneous faults" which applied to proce- 
dures relating to the example as a whole. 
It was these data, carefully classified, 
which, it was thought, provided the best ba- 
sis for evaluating the second objection to 
the "crutch." 

Complications. ‘The theory here was 
that if use of the "crutch" did really in- 
terfere with learning, this effect should be 
discernible in the degree to which children 
taught touse the "crutch" also made use of 
the faulty procedures. ‘Ihus, if the "crutch" 
children used a greater variety of faulty 
procedures, or used them more generally, or 
used them oftener, then the weight of the 
argument would be against the "crutch." his 
line of thinking suggested the method of at- 
tacking the problem, which was to compare the 
occurrence of faults in the work of the 80 
children in the two "crutch" schools with 
their occurrence in the work of the 165 chil- 
dren in the five "non-crutch" schools. 

These comparisons, which at the outset 
appeared to be simple in nature and few in 
number, became increasingly complicated and 
numerous. As had been said, it had been de- 
cided to study the two groups of children 
with regard to their comparative use of faulty 
procedures. But the phrase "comparative use" 
proved soon to be much more easily repeated 
than interpreted. For example, how was 
“comparative use” to be measured? The first 
measure which came to mind, “number of ap- 
pearances", was found to be wholly unsatis- 
factory, for it became clear that it made 
much difference how and where the fault ap- 
peared. Thus, a frequency of 10 meant one 
thing when it represented the number of times 
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|@ given fault was employed by a single pu- 
| pil and it meant another thing when it stood 
|for the total use by eight or ten pupils. 
The distinction to which attention is 
| Called in the last sentence is one which has 
|More than statistical significance; it has 
|much significance for instruction when its 
—— are understood. When each 
child in a class of 45 fails one or more 
times to reduce an improper fraction on a 
test, making this error a total of 75 times, 
the teacher recognizes the need of remedial 
instruction for the class as a whole. When, 
however, the total of 75 failures to reduce 
is confined to five or ten of the pupils, 
the teacher has quite a different task; now 
she proceeds through individual instruction, 

The comparisons to be made in this 
| study had, therefore, to take account both 
of the number of times faulty procedures 
were used (frequency) and of the number of 
children who used them (universality). The 
influence of the "crutch" on learning could 
not properly be evaluated by considering 
only one of these two measures of "compara- 
tive occurrence." 

A second group of difficulties arose in 
connection with determining the faulty proce- 
dures, use of which was to constitute the 
| basis for evaluation. If the comparisons 
were restricted only to specific faults, 
then something might be lost. For instance, 
the "crutch" children might differ markedly 
from the “non-crutch" children, not so much 
in their use of certain faulty procedures, 
as in the proportionate number who used 
faulty procedures regardless of type, or in 
the number of times they used all faulty 
procedures combined, or in the variety of 
faults to which they were subject. Steps 
had to be taken to assure that measures were 
obtained at all these points. 

Nor were the complications yet ended. 
There remained the question of the trust- 
worthiness of whatever differences were 
founa. The simple process of subtracting 
the number of uses for the one group from 
the number of uses for the other group did 
of course yield a difference, tut this dif- 
ference itself demanded some study. After 
all, the "non-crutch" children outnumbered 














1. The list is omitted from this report for two reasons. 


In the first place, it would require too much space. 
second place, the purpose of the report does not require detailed descriptions of all the faults. 


In the 
Such information 


will be provided only for those faults which for one reason or another need explanation. 
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the "crutch" children more than two to one, |the solid line for the "crutch" pupils and 
and on this account comparisons could not be | the dotted line for the "non-crutch." It 
directly made as to frequency or universali- |will be observed that the curves are markedly 
ty of use unless appropriate corrections similar in shape and that they parallel each 
were made. Furthermore, the “non-crutch" other rather closely. As measured by this 
children represented enrollment in five |method, then, there was little difference be- 
schools; the “crutch” children in two | tween the two groups in the amount of their 
schools. Differences in faults as between faults. 
the two groups of children could not be as- But the limiting phrase "as measured by 
cribed to instruction or non-instruction in | this method" proved to be a highly important 
the "crutch" until due consideration had lone. The distribution table and Figure 1 did 
been given to the possible influence of this | show general agreement in the use of faulty 
last named factor. In other words, differ- | procedures, but being based upon gross meas- 
ences which were not reliable differences | ures, they could, it was clear, conceal real 
contained little of worth for the purposes Pprccespagesovi For example, the distribution 
of evaluation. | table revealed nothing regarding the relative 
Use of faulty procedures. A beginning | number of different kinds of faults to which 
was made by noting the relative number of | the two groups were subject. Evidence on 
times the "crutch" and the "non-crutch"chil-| this point was essential to a complete evalu- 
dren employed faulty procedures. As a first| ation of the effect of the "crutch" upon 
step the children in the two groups were | learning. The first count seemed to indicate 
separately arranged in order according to | that the "crutch" children (perhaps because 
the total number of faults which appeared | of the “crutch") employed a smaller number of 
on their papers. From this array frequency | different faulty procedures, their total be- 
distributions were made (Part II, Table V) | ing &% as compared with 49 for the "non- 
to show the number of children in the two /crutch" children, Corrections (described in 
groups who had made use of faulty proce- Part II) to remove the influence of size of 
dures certain numbers of times. ‘ihe fre- group and of number of schools represented 
quencies were then transmuted to per cents materially reduced the number of the "“non- 
in order to make allowance for the differ- crutch" group to 36 or less. Thus the prelim- 
ence in size of the groups. Finally the inary impression that the "crutch" had possi- 
two per cent distributions were plotted in | bly performed a useful service in preventing 
curves in order to permit a direct compari- | wide variation in individual practice had to 
son of the "crutch" and the "non-crutch" be modified, and again evidence was found to 
children regarding their use of faulty pro=- | be lacking of any large amount of dissimilar- 
cedures. Figure 1 contains the two curves, | ity in the degree to which "crutch" and "non- 
crutch" children made use of faulty proce- 















24% dures. 
—— Further study, however, produced other 
Per cent measures which, when applied, led again to 
of group }, "crutch changes in the evaluation of the worth of the 
16 1 ——— —s group "crutch." Neither of the two kinds of com 
"non-crutch" parisons which thus far had been made had re- 
4 filed sulted in a single precise measure of the 
eid relative use of faulty procedures. For this 
purpose the obvious measure seemed to be an 
average. It was found that the "crutch" chil- 
 * dren on the average (median) had used faulty 
. » Lae procedures 11.5 times and the "non-crutch" 





children, 12.5 times. When the types of 
faults on which the comparison was based were 
reduced to the 28 kinds which were found in 
common in the work of both groups, the aver- 
ages became 11.5 and 10.5, respectively, for 


0 8 16 24 32 40 4a 
Times used 


Figure 1. Per cents of the 
two groups using faulty procedures 
indicated number of times. 
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"crutch" and “non-crutch" children. It now 
pegan to look as if the "crutch" instead of 
shielding children from faults might actual- 
ly have exposed them excessively to faults. 

Another kind of average was tried out. 
Whereas the first average was a measure of 
the number of times faulty procedures were 
employed regardless of the number of chil- 
dren who used them, this second average was 
specifically designed to take account of the 
number of users in each group. A distribu- 
tion was first made for each group in such 
a way as to show the number of pupils who 
had employed each of the various faulty pro- 
cedures. For these two distributions aver- 
ages (medians) were computed to give the 
average number of pupils in each group who 
had used the different faulty procedures, 
hese averages were then changed to per 
cents of each group in order to correct for 
differences in size. The resulting measure 
of 3.6% for the "crutch" group is inter- 
preted thus: on the average the different 
faults of the "crutch" children appeared on 
the papers of 3.6% of the members of that 
group. The corresponding measure for the 
"non-crutch" group was 2.1%. The advantage 
of 1.5% for the “non-crutch" children re- 
mained when only the 28 common types of 
faults were included in the comparison (medi- 
an for “crutch” children 4.8%, for "non- 
crutch" 3.3%). 

The nature of the final conclusion to 
be drawn from the comparison of the two 
groups on their use of faulty procedures 
thus depended upon which of the measures 
developed seemed to be most valid. On the 
basis of the distribution table and of an 
analysis of the variety of faults character- 
istic of thé two groups little difference 
could be found, and the conclusion would be 
that the "crutch" apparently did not tend to 
involve its users in faulty procedures. On 
the basis of the averages (number of times 
faults appeared and number of children sub- 
ject to faults) the "non-crutch" children 
seemed to be very slightly superior (though 
not reliably so), and the conclusion would 
be that the "crutch" had been a source of 
possible harm, 

Useof particular types of fault. Thus 
far, conclusions were available only from 
comparisons which neglected the nature of 
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_faulty procedures. 
_from the first been the intention to do so, 
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Even if there had not 


| the conflicting conclusions from the general 
| comparisons would have made it imperative to 
|study possible differences in the faults 

| which were characteristic of the two groups 


of children. The next step seemed to be to 


| compare the “crutch" and the "non-crutch" 
| pupils as to the manner in which they had 
| employed particular kinds of faulty proce- 


dure. 
An examination of the data regarding 


| the use of the 55 faults found on the papers 


of the 245 children at once revealed the fact 
| that many could be eliminated from further 
consideration because of their infrequent oc- 
currence. Two very lenient criteria, the 
application of which could not possibly lose 
anything of importance, were adopted as a 
|means of selecting the particular faults 
which would be studied. In order to be in- 
_Cluded a given fault had to satisfy at least 
one of two sets of conditions: first, an 
average frequency of use of .20 times in one 
or both groups of children; second, a use 
of 7.5% by the children in one or both groups. 
| A total of 32 faults failed to meet these re- 
quirements, and the comparison of the groups 
was then restricted to the 23 types of faults 
which remained. 

The first test of different usage in the 
groups was that of relative number who had 
employed the separate faults (Part II, Table 
VI). For only five faults, here simply desig- 
nated as faults D, G, M, U, and V, did there 
seem to be even a possibility of difference. 

The second test was that of the average 
| number of times the particular faulty proce- 
| dures were used (Part II, Table VII). On 
| 





this basis none of the previously rejected 
eighteen could be re-claimed for study as 
representing differences between the groups, 
nor could any one of the five which had met 
the first test meet the second as well. 
A third test was then applied, a test 

— carried the study into the separate 

| Schools as a means of determining whether 

| the five faults which had gotten by the first 
/hurdle were actually characteristic of the 
group which was found to employ it more com 
monly. Faults D, M, and V were more popular 
among the "non-crutch" children. Faulty 
procedure D was found in use in only three 





| 
| 








l. As a matter of fact the differences of 1.5% were not sta 


tistically significant, as shown in Part II. 














of the five “non-crutch" schools. Its use 
was therefore not typical of "non-crutch" 
children, and it could not be accepted as a 
real difference between "non-crutch" and 
"crutch" children. Application of the same 
test showed that none of the other four 
faults (G, M, U, and V) was genuinely pecul- 
iar to the group in which it was found more 
commonly. 

There remained a fourth test, the most 
crucial one, which would have been applied if 
this had been necessary, namely, a test of 
the functional relationship of the fault to 
use or non-use of the “crutch." The first 
three tests were purely statistical; they 
could do no more than show whether a given 
fault appeared more or less frequently among 
the “crutch” or the “non-crutch" children. 
Conceivably a faulty procedure might have 
been accidentally associated with the use of 
the "crutch" without its being caused by the 
"crutch" or even closely related to it. . In 
other words, if a given faulty procedure had 
met the first three tests it would still have 
had to pass this one of functional connec- 
tion. But no faulty procedure managed to 
get by the first three tests, 

Conclusion to (b). The weight of the 
evidence, as obtained from the preponderant 
number of comparisons attempted, was that the 
“crutch” as it was employed in schools 1 and 
2 represented a set of habits which were 
self-contained and relatively autonomous, a 
system which allied itself with or remained 
independent of other sets of habits, such as 
those involved in faulty procedures, as the 
circumstances of learning dictated. Certainly 
there was in the various sorts of analyses 
attempted little to support the contention 
that use of the "crutch" confused children 
and indirectly impeded their development of 
the ability to deal with proper fractions. 





(c) Relationship of use of the "crutch" 
to accuracy in adding proper 
fractions 


A third objection. Still, the opponent 
of "crutches" might not be convinced that 
children could without serious consequences 
be taught to add proper fractions through the 
agency of the elaborate form employed for 
that purpose in schools 1 and 2. This device 
might not become fixed as a permanent habit, 
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and it might not interfere with learning in 
the manner in which its effect in this re- 
‘gard was here studied. But, he might con- 
‘tend, the "crutch" might exert its dire in- 
|fluence in some other way which would reveal 
|i{tself in lowered ability to add fractions, 

Here was another challenge, a vital 
|test, which had to be met by the "crutch." 
| Did the children who had been taught to em 
| ploy the form fail to develop satisfactorily 
| the ability to add proper fractions? Again 
| data were at hand for an evaluation of the 
| objection, this time in the scores on the 
fraction test. 

Comparison of test scores. Test scores 
made by “crutch" and by "non-crutch" chil- 
dren were arranged in frequency distribu- 
tions, and the median scores were calculated 
for the two groups (Part II, Table VIII). 
The average score for the "crutch" children 
was found to be 6.9 and that for the "non- 
crutch" children, 7.3. The difference, 
though too small to be taken seriously, was 
in favor of the “non-crutch" group, and on 
this account seemed to support the position 
of the opponents of "crutches." 

A check was made on this conclusion by 
studying the comparative scores of the groups 
in a different way. Credit was allowed for 
answers to examples which were correct ex- 
cept for failure to reduce answers to lowest 
terms, and the new test scores were arranged 
in distributions (Part II, Table IX). The 
median for the "crutch" children rose to 
12.8 while that for the "non-crutch" group 
rose to 14.2. Here was an even larger dif- 
ference in favor of the “non-crutch" chil- 
dren, and furthermore a difference too 
large to be disregarded as negligible in 
amount. Something apparently had happened 
in the change from the one method of scor- 
ing to the other which had proved disastrous 
to the "crutch" children. 

An attempt to account for differences. 
A search was instituted to isolate this dis- 
turbing factor, not merely in the hope of 
accounting for the loss suffered by the 
"crutch" children, but more in the hope that 
through the search something might be dis- 
covered concerning the relationship between 
use and non-use of the "crutch" and effi- 
ciency in adding fractions. 

A comparison was first made of the rel- 











ative success of the two groups of children 
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on the three types of example in the frac- 
tion test. No difference was observable 
in the case of examples 1-5 (common denon 
{nator in both fractions) or in the case of 
examples 10-15 (common denominator not giv- 
en); but important differences could be de- 
tected in the case of examples 6-9 (common 
denominator in one fraction). Here the 
"crutch" children were distinctly inferior. 
Just why the "crutch" children should 
have been so markedly less successful with 
these four examples was not at all clear, 
but it was even less clear that their rela- 
tive loss in efficiency was to be traced to 
the influence of the "crutch." An explana- 
tion was first sought in the comparative 
manner in which these children employed 
faulty procedures in the three types of ex- 
ample. Two facts were uncovered: first, 
these children employed a greater variety of 
faulty procedures in these examples and, sec- 
ond, they employed them more frequently. 
They were especially prone to add the given 
denominators in order to secure the common 
denominator (instead of taking the common 
denominator as it was given in one fraction) 
and to make numerous errors in treating the 
numerators (being especially apt to add the 
numerators as given without changing them in 
terms of the common denominator). These 
facts were undeniable, but at the same time 
they failed entirely to require as their ex- 
planation some property of the "crutch" which 
necessarily brought about the situation as it 
was found to exist in the two schools which 
provided instruction in that device. 
Conclusion to (c). The third of the 
three objections to "crutches" seemed, in 
the case of the particular "crutch" under 
observation, to be the soundest. The "non- 
crutch" children were, on the average, supe- 
rior to the "crutch" children in adding prop- 
er fractions, although the amount of this 
superiority was too slight to meet the usual 
statistical requirements for reliability. 
Differences of an instructional sort other 
than those directly associated with the 
teaching of the "crutch" might explain the 
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| “crutch” is not very strong. 
_ three objections to "crutches" in general 
| have been raised against the particular one 
_in use in schools 1 and 2, and none of these 
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superiority of the "non-crutch" children ful- 
ly as well as the fact that they did not em 
ploy the device in adding fractions. These 
other explanations for the difference, which 
were not here canvassed, seemed the more 


| reasonable since no functional relation 
| could be discovered between the nature of 


the "crutch" on the one hand and the spe- 
cial aspects of the ability to add fractions 


| in which the "crutch" children were the 
| weakest. 


SUMMARY OF PART I 


Taken as a whole the case against the 
One by one, 


objections was found to be valid. Failure 
to discover reasons not to teach the "crutch" 


| may readily be interpreted to signify that 
| the "crutch" should be taught. 


Such an in- 
ference is, however, unwarranted. On the 
contrary, the whole study is partidl and, in 
one sense of the word, negative. A great 
many aspects of the problem have not even 
been touched upon, and the ones which have 
received attention are those which had their 
origin in possible weaknesses of the device, 
Before a recommendation could be given this 
"crutch", it would be necessary to establish 
three facts: first, that it does not sub- 
ject children to harmful influences other 
than those here considered; second, that it 
possesses distinct advantages which facili- 
tate learning; and third, that it is at 
least as effective as other available meth- 
ods of accomplishing the same end. The pur- 
pose of the study was not, however, to prove 


_the worth of the "crutch", but rather to 


evaluate the device in so far as was feasi- 
ble in terms of the available data. In oth- 
er words the study is illustrative rather 


| than final; it is an example of an attempt 
40 substitute quantitative research for opin-~ 
| ion in a field in which opinion still is too 


commonly the determining factor,?+ 





1. The implications of the study for the psychology of learning and for the teaching of arithmetic are set forth in an 
article entitled "The Place of 'Crutches' in Instruction" soon to appear in the Elementary School Journal. 























Part II 
STATISTICAL EVIDENCE 


Purpose and nature of Part II. The in- 





troductory material preceding Part I informed | numbers of times. 


the reader of the purpose of the study as 
that of evaluating an arithemtic "crutch" 
from three points of view, reported the sub- 
jects to be 245 fifth grade children enrolled 
in seven schools, described the test onwhich 
data were secured, and illustrated the form 
of the "crutch" in connection with one ex- 
ample. Part I traced the course of the in- 
vestigation, setting forth in detail the 
steps taken and giving the reasons therefor. 
In Part II very little of the account of the 
study will be repeated, only as much as may 
be necessary in presenting some of the basic 
data and in explaining such measures and in- 
terpretetions as need clarification. 

Certain definite limitations attach to 
the study as a whole. Chief among these is 
that fact that, due to the nature of the 
data collected and to the conditions under 
which they were obtained, the method of eval- 
uation is largely statistical and analytical. 
For the adequate evaluation of the "crutch"a 
control-group experiment would have been mich 
superior since it would have permitted imme- 
diate management of the factors which con- 
tributed the data. Under these circumstanc- 
es it would have been possible to keep the 
“crutch” under direct observation and tonote 
the character of its relation to learning in 
the process of the learning. ‘The method of 
the present study is, by contrast, an ex post 
facto approach. ‘that is to say, it repre- 
sents a survey made after the factors had 
operated to produce certain results which 
then had to be scrutinized to detect possi- 
ble relationships between factors and data. 
Yet, while these limitations mst be frankly 
admitted, they do not by any means invalidate 
the findings. On the contrary, the conclu- 
sions are advanced with considerable confi- 
dence in their trustworthiness. 


(a) Persistence of the "crutch." The 





first of the three separate studies, that 
dealing with the persistence of the "crutch", 
was confined to the pupils of schools 1 and 
2. On the application of the fraction test 
to the 35 children in school 1 during the 
last week of January, 1930, it was found 
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'that these pupils used the "crutch" verying 
Nine did not use it at 
|all; thirteen used it 10 times; and the re- 
|maining thirteen used it 15 times, in all of 
| the examples. The manner in which use of 
the “crutch” varied within the group of 35 
' children suggested the hypothesis that they 
|had tended to neglect and eventually to dis- 
| card the "crutch" as it ceased to serve any 
| purpose. In an attempt to check this hy- 
| pothesis, as well as to secure further light 
Peace nature of the persistence of the form, 
a number of measures were collected of both 
penne and arithmetic ability. These meas- 
ures for the 31 of the 35 children who took 
all the tests have been assembled in ‘lable I. 
This table is read as follows: Subject 1 
belongs to the group of children who did not 
use the "crutch" on the fraction test. His 
age was 142 months, his mental age 129 months, 
and his I.Q. 83. During the preceding year 
of two terms he had averaged 57.5 in arith- 
metic, according to his teacher's marks. He 
secured a score of 14 on the arithmetic prob- 
'lem-solving test in the National Intelligence 
‘'est (which was given to secure measures of 
M.A. and I.Q.). On the fraction test he 
solved five of the fifteen examples correct- 
ly and made the same score when credit was 
allowed for unreduced answers. The means 
‘for the three sub-groups of children are 
| given in the rows following the three parts 
of the table. ‘ihus, the means for the 0- 
group were 142.0 months in C.A., 140.5 
months in M.A., and so on. 

The separate items in Table I (for ex- 
ample, the C.A. of 139 for Pupil 31) possess 
little meaning; they are therefore in a form 
which is hardly conducive to understanding 
and interpretation. These data have, conse=- 
quently, been re-worked with the results 
which appear in Table II. The procedure 
followed in deriving this table requires ex- 
planation. First, the 31 children, regard- 
less of the number of times they employed 
the "crutch", were ranked with respect to 
the one trait of C.A. and were then divided 
into five groups of approximately equal num 
ber. ‘Thus, there were six children (pupils 








2, 6, 42, 61, 65, 66) whose ages fell within 
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TABLE I 


DATA CONCERNING PUPILS OF SCHOOL 1, WHO USED FRACTION "CRUTCH" 
VARYING NUMBERS OF TIMES IN TEST 1 
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Measures of Arithmetic Ability 
Frequency of General Measures Average of N.I.T. Problem- Fraction Test 
Pupil | Use of the C.A. M.A. I.Q. Teachers' Marks | Solving Score | Number Ex- | Number Right 
*Crutch" in in in actly Right Except 
Mos. Mos. Mos. Reduction 
(1) (2) (3) (4) (5) (6) (7) (8) (9) 
1 0 142 129 83 57.58 14 5 5 
2 1@) 158 152 98 59 14 1 1 
3 0 140 138 99 57* 14 3 5 
4 0 132 151 114 89.5 20 6 6 
5 0 128 138 | 110 78* 16 9 15 
6 0 160 | 152 95 86” 12 9 15 
7 (@) 144 131 91 69 12 6 15 
8 ie) 132 133 101 81 14 10 12 
Mean 142.0 | 140.5; 98.8 72.2 14.5 6.1 9.3 
31 10 139 160 115 85.5 16 10 10 
32 10 130 181 139 94 18 10 11 
33 10 148 149 101 97 22 10 10 
34 10 143 157 109 87.5 20 15 15 
35 10 120 141 117 80.5 18 9 13 
36 10 132 130 98 84 10 12 15 
37 10 144 137 95 54 14 6 15 
38 10 133 122 91 79 16 11 11 
39 10 144 146 101 79 14 9 11 
40 10 132 138 104 70 12 10 14 
41 10 120 147 132 sad 14 12 12 
42 11 152 137 90 62 6 “4 10 
Mean 136.4 | 145.3] 107.7 79.1 15.0 10.3 12.3 
61 15 161 138 85 84 16 14 15 
62 15 132 132 100 69 16 7 15 
63 15 120 159 132 89 18 12 13 
64 15 136 136 100 76 20 + 4 
65 15 149 150 101 64 16 8 10 
66 15 170 “129 76 60 10 5 11 
67 15 144 | 114 79 70* 12 7 15 
68 15 132 129 98 82 12 7 15 
69 15 120 132 110 88 8 13 14 
70 15 145 146 101 60 12 6 15 
71 15 ize | 154 | 113 93” 16 12 15 
Mean 140.5] 138.1} 99.5 75.9 14.0 9.1 13.1 






































"Teachers' marks thus indicated represent one term's work; all others are based on the average of two 


terms' work in arithmetic. 


the limits of 149-70 months; these were the 
oldest children and are here included in the 


first or highest fifth. 


Six other children 
(pupils 7, 33, 37, 39, 67, 70), whose ages 
lay between 144-48 months, constitute the 





second fifth, and so on. 
I.Q., average mark in arithmetic, etc., were 
all separately transmuted in a similar man- 
ner, and the 31 children were divided into 
approximately equal fifths on each scale. 


The data for M.A., 





1. This division into fifths was, as stated, only approximate. 
fifth was 6.1 (51 divided by 5). 
or more than 7, seven of these being 5. 
the last set of three columns. 


The theoretic number which should have appeared in each 
Actually in only eight of the 55 fifths in Table II are there less than 6 individuals 
One important variation from the practice of dividing into fifths appears in 


Twelve of the 51 children made perfect scores on the fraction test when credit was 
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COMPARISONS OF THREE GROUPS OF USERS OF THE "CRUTCH" WITH 
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RESPECT TO CERTAIN MEASURES OF DEVELOPMENT AND ARITHMETIC ABILITY 





Score-Fraction Test 











Fifths of Pupils with | Average | Score in | Answers Answers Right 
Respect to Trait Named C.A. | M.A. I.Q. Mark in Arith., Entirely | Except Reduc. 
at Head of Each Set of | Arith. N.I.T. Right 
Three Columns ne 
15 10 0 | 15 100 |15100 | 15100/15100 |15100 15 10 0 
First (highest) 312/e232/]1 41 321] e2 41 5 30 





Second 2 61 : 2s S @a 


2 61 4 21 0 51 





Third (middle) 8 23 i es 421 


1 42 0 34 0 32 2 31 





Fourth 





3 2 





3 





2 








Fifth (lowest) 231 321 322 


2123 6 20 2 08 104 





Total 














11 12 8 














Table II consequently shows the distribution 
of the 31 children, first, according to their 
position in the various fifths for the dif- 
ferent traits (C.A., M.A., etc.), and second, 
according to the number of times they used 
the "crutch" in the fraction test (15, 10, 
and 0). Thus, one notes from the table that 
of the oldest (highest) fifth, three pupils 
used the "crutch" 15 times; one used it 

10 times, and two used it not at all. Of the 
seven most mature mentally (M.A.), two used 
the "crutch" 15 times, three 10 times, and 
two O times. 

With the data in this form it was now 
possible to study correspondence between any 
of the separate measures and the number of 
times the "crutch" was used on the test. For 
example, any marked relation among these 31 
children between age-level and continued use 
of the "crutch" would show itself in a ten- 
dency for the data to be concentrated in cer- 
tain sections of the part of the table which 
is devoted to C.A. Thus, if the older chil- 
dren were the ones first to abandon the use 
of the "crutch" that condition would be re- 
vealed by the presence of a disproportionate- 
ly large number of individuals in the upper 
right and the lower left corners of the sec- 
tion and by a disproportionately small num 





ber of individuals in the upper left and low- 





er right corners. On the other hand, if the 
reverse of this tendency obtained and the 
Older children tended to rely increasingly 
on the "crutch", this condition would be 
evidenced by a clustering of cases in the 
upper left and the lower right corners and 
by a lack of cases in the upper right and 
the lower left corners. 

There is no marked concentration in any 
part of the C.4. section of Table II. On the 
contrary, the cases are fairly evenly dis- 
tributed, indicating the absence of any type 
of relationship between C.A. and persistence 
in the use of the "crutch." If the table 
is read vertically by colums, instead of 
horizontally by rows, the same conclusion is 
seen to be the only tenable one. Thus, of 
the eleven children who used the "crutch" 15 
times three were among the oldest six in the 
grade, two were in the second fifth, and two 
also were in each of the three lowest, or 
youngest,fifths. The cases in the colums 
for the 10- and the O-groups are similarly 
distributed. 

The method of reading the data for M.A. 
and I.Q. in Table II is the same as that for 
C.A. Relationship between persistence in 
the use of the "crutch" and these items of 
general ability would be similarly manifest- 
ed by the distribution of cases within the 



















(Footnote continued) 


allowed for unreduced answers, and it was therefore necessary to call them the two highest fifths, no difference be- 


tween them being recognized. 
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appropriate parts of the sections. With re- 
spect to both M.A. and I.Q., as was the case 
with respect to C.A., the 31 children are 
rather uniformly scattered and the conclusion 
with regard to them is like that for C.A. in 
being negative. In this group of children 
there was no marked association between meas- 
ures of these traits and continued use of the 
form; in other words, the "crutch" persisted 
or disappeared from use quite apart from the 
single fact of differences in mental maturity 
or in brightness. 

The four measures of arithmetic ability 
in Table II represent three distinct aspects 
of the total ability. The first (average 
mark in arithmetic) is the most general. It 
represents the teacher's evaluation of the 
year's standing tof each child in all those 
arithmetical processes which were taught in 
grade IV, and hence does not include ability 
to add proper fractions. The second measure 
is based on the problem-solving test in the 
National Intelligence Test. ‘Iihese scores, 
therefore, refer to a particular arithmetic 
ability rather than to the composite compre- 
hended in teachers! marks and refer, further- 
more, to an ability which is supposedly quite 
different from that involved in adding frac- 
tions. The last two measures were obtained 
from the fraction test and are therefore 
measures of the specific ability under in- 
vestigation. 

The data for the first two measures of 
arithmetic ability may be dismissed with the 
brief statement that they reveal no explana- 
tion at all for continued use of the "crutch." 
The last two measures deserve more detailed 
consideration because of their being measures 
of the ability to add fractions. When credit 
was allowed only for completely correct an- 
swers, five of the fourteen who made the 
highest scores on the fraction test used the 
"crutch" 15 times and thirteen of them 10 or 
more times. On the other hand, six of the 
twelve who made the lowest scores used the 
form 15 times, and seven, 10 or more times. 
Excessive dependence on the device was, then, 
almost as apt to be associated with inability 
as with marked ability to add proper frac- 
tions. When credit was allowed for answers 
correct except for failure to reduce, a sim- 
ilar association was revealed. It is true 
that half of those in the highest two fifths 
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|on the fraction test used the “crutch” 15 
times and that only one-fourth (three out of 
thirteen) in the lowest two fifths in the 
test used the form this number of times, but 
this apparent difference is offset by a cor- 
responding reversal for those who used _ the 
form 10 times and by a fairly even distribu- 
tion of cases who did not use the "crutch" 
at all. 

The study thus far had been concerned 
with the relationship between retention of 
the "crutch" on the one hand and measures of 
C.A., M.A., I.Q., and of various aspects of 
arithmetic ability, each taken separately, 
on the other hand. The conclusion, that no 
one of these traits alone was closely asso- 
ciated with use or disuse of the form, might 
be true, at the same time that there might 
have been a relationship between persistence 
of the "crutch" and certain combinations of 
items. This hypothesis was tested in the 
case of two of the more promising combina- 
tions, namely, M.A. and general arithmetic 
ability, and I.Q. and general arithmetic 
ability. The method of study is illustrated 
in Table III, which is interpreted as fol- 
lows: the seven children who were most ma- 
ture mentally (highest fifth, M.A. 152-81 
months) are distributed according to their 
standing in general arithmetic ability and 
according to their use of the “crutch.” Thus, 
| there were three of the seven who were also 
| in the highest fifth of arithmetic ability. 
Two of the three used the form 15 times, and 
the other one, 10 times. There were also 
three children who ranked in the highest 
fifth in M.A., in the second highest fifth 
in arithmetic ability. Of these, two used 
the form 10 times, and the other O times. 
The last of the seven most mature mentally 
was in the poorest fifth in arithmetic, and 
he made no use of the form. The data for 
the other fifths in M.A. are read in similar 
fashion. An analysis of the data in the 
sections for the separate fifths of the 
table is space-consuming and may easily be 
made by the reader himself. In some cases 
(for example, in the data for the highest 
fifth in which excessive use of the "crutch" 
seems to be related both to high M.A. and to 
high arithmetic ability) relations might be 
found, but these relations would not hold in 











1. Except in the case of six children (see Table I) for whom the records showed marks for only one of the two items, 
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Fifth in 
Fifth in Arith. 
M.A. (Teachers' Marks) 





Frequency of Use of the "Crutch" 


Total 
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First 88 - 97 

First Second 82 - 87 
(highest) Third 76 - 81 
152-81 months Fourth 62 - 70 
Fifth 54 - 60 
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other fifths of the table (for example, ex- 
cessive use of the "crutch" is also charac- 
teristic of those who were low in M.A. and 
in arithmetic ability). The only conclu- 
sion to be had from a consideration of all 
31 cases is that whatever may have been the 
reason for their retaining or relinquishing 
use of the "crutch", that reason is not to 
be found in any peculiar combination of M.A. 
and general arithmetic ability. 

A table similar to Table III was con- 
structed to permit the study of the relation 
between [.Q. and general arithmetic ability, 
combined, and retention of the "crutch." 
This table, here omitted, likewise yielded 
negative results; persistence of the "crutch" 
was not to be accounted for in terms of some 
special combination of this pair of items. 





The fraction test was given to the 45 
children in grade V of school 2 during the 


first week of February, 1930. Table IV con- 
tains the data which were collected for these 
pupils. The table is divided into two parts, 
the upper half containing the measures for 
the children who did not employ the "crutch" 
and the lower half containing the measures 
for the children who used it 15 times. Un- 
like Table I for the children in school l, 
Table IV does not include the measures for 
the problem-solving test in the National In- 
telligence ‘test. 

No exhaustive analysis like that for 
school 1 was undertaken for school 2. Three 
reasons were responsible for this omission, 
In the first place, the school organization 
into upper and lower sections of the grade 
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DATA CONCERNING PUPILS WHO USED FRACTION "CRUTCH" 


TABLE IV 


VARYING NUMBERS OF TIMES IN TEST 1 





Pupil 


General Measures _ 


Measures of Arithmetic Ability 





Use of 
*"Crutch" 
Test 1 


C.A. 


iA." 
| 





Section B 
101 
102 
103 
104 
105 
106 
107 
108 
109 
110 
111 


a T 








|oococooCeCe eooooocoo°o°cdco 


| 
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Section C 
1314 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 


128 
126 
102 
134 
125 
134 
120 
126 
154 
127 
122 


122 
125 
132 
143 
118 
140 
121 
122 
148 
121 
130 


127. 


= 
O| 147.6 


7 Teachers’ 
T.Q. Marks 
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109 
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130 
152** 
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"M.A.'s and I.Q."s were obtained through the use of the National Intelligence Test, which was admin- 
istered by the writer. 


“These measures are estimated. 


These two children scored higher than this. 
#These Children failed of promotion the preceding quarter, and were required to repeat the work of 


the C-section. 


group as a whole. 


The highest score for which an M.A. is given is the score of 198. 


There scores are, on this account, omitted in calculating the measures for the C- 
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(B and C, respectively) which were separated 
by three months of schooling, paralleled and 
seemed to account for the difference in use 























of the "crutch." In the second place, there 
were no differenc: ithin either section 
with regard to the use of the form, a condi- 
tion which made such a study unnecessary, as 








it was necessary in the case of school l. In 
the third place, the number of children in 
the two sections would have been too small 
to have justified the types of distinction 
made in the case of school 1 even if such 
an analysis had been deemed advisable. It 
is, however, interesting to compare the 
means of the O-children (Besection) with the 
means of the 15-children (U-sectior). ‘he 
former group, 
were more than a year older mentally and werc 
considerably brighter (an average margin of 
more than seven points in I.Q.). They thus 
seem to have been selected for Section B 
largely on the basis of intellectual factors. 
In arithmetic ability in general and in 
scores on the fraction test they showed the 
superiority that would have been expected 
from their greater mental ability and from 
their larger amount of schooling. 

The rest of the search for the factors 































































































best detailed in the narrative of Part I. 
Statistical analysis played an exceedingly 
small role in that search. Solution of the 
problem came through simple comparisons, 
through the results of re-testing, and final- 
ly through conversation and correspondence 
with the classroom teachers. On this ac- 
count, there is no further reference in Part 
II to the first problem investigated. The 
following section of Part II is devoted to 
the second problem, namely, the degree to 
which use of the "crutch" interfered with 
learning to add fractions by involving its 
users in faulty procedures. 

(b) Use of faulty procedures. As ex- 
plained in section (b) of Part I, an attempt 
was next made to determine whether the 
"crutch" had interfered with the development 
of ability to add proper fractions. The 
means adopted to measure this interference 
with learning was to note whether use of the 
"crutch" was related in any close way to use 
of faulty procedures (errors, roundabout 

processes, etc.) in adding fractions. A list 
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though of the same average C.A,, 


underlying the persistence of the “crutch” is | 
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faults had been collected from 
subjects in the investigetion through 
conference with each child. 

In order to determine whether or not 
use of the “crutch” had led to an excessive 
dependence on faulty procedures in general 
or on some particuler faulty procedures the 
80 children in the two "crutch" schools were 

| compared with the 165 children in the five 
"non-crutch" schools. In other words the 
"non=crutch" schools served as a check group. 
They could serve this purpose only on the 
condition that the two groups of schools 
(and of children) were alike in all essen- 
tials except that of instruction in the 
"crutch." ‘there is considerable reason to 
believe that the requirement of similerity 
was met. In the first place, both sets of 
schools used the same arithmetic texts, fol- 
lowed the same course of study, and allotted 
_the same amount of time to the subject. In 
| the second place, economically and socially 
the communities served by the two sets of 
schools were about equally representative of 
the total population served by the seven 
|schools. One of the "crutch" schools was 
| located in a suburban area slightly below 
| average in economic circumstances and social 
standing. ‘Ihe other of the "crutch" schools 
| was situated in a slightly better-than-aver- 
age city district. ‘the five "non-crutch" 
schools were distributed as follows: one in 
a very poor suburban area, one in an everage 
suburban area, one in a superior suburban 
/area, one in an average city area, and one 
_in a slightly superior city area. These 
facts, which show likeness both in the gen- 
eral features of arithmetic instruction and 
in pupil material, would seem to justify the 
| comparisons which were made between "crutch" 
and "non-crutch" children. 
| The first comparison had to do with the 
number of times faulty procedures were used. 
For this purpose Table V was prepared. It 
is read in the following manner (fourth row): 
two children in schools 3-7 ("non-crutch" 
| schools) exhibited 40 to 43 faults in their 
work as compared with one such child in 
schools 1 and 2 ("crutch" schools). These 
-numbers of chiléren represented respectively 
|1.2 per cent and 1.3 per cent of their 
|groups. A total of 3.6 per cent of the "non- 
_crutch" and of 3.8 per cent of the "crutch" 
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TABLE V 





COMPARTSON OF SCHOOLS 3-7 ("NON-CRUTCH" SCHOOLS) WITH SCHOOLS 1 AND 2 ("CRUTCH" SCHOOLS) 
WITH RESPECT TO FREQUENCY OF USE OF ALL TYPES OF FAULTY PROCEDURES COMBINED 































































children used faulty procedures the same or 
a larger number of times. The data in the 
second pair of columns were utilized in 
Part I to construct Figure 1 in order to 
show graphically the relative distribution 
of pupils in the two groups with respect to 
the scale of frequencies given in the first 
column of ‘'able V. The last two columns re- 
veal the same facts through cumulative per 
cents. It will be noted that 28.9 per cent 
of both groups used faulty procedures 20 

or more times and that approximately 50 per 
cent used faulty procedures twelve or more 
times. On this basis the two groups of chik 
dren are seen to be more alike than differ- 
ent in the degree to which faults appeared 
in their work. 

Even though it did permit a study of 
the two groups point for point on the scale, 
this type of comparison was not satisfactory. 
It entirely neglected the question regard- 
ing the variety of faults to which the 
groups were subject. Accordingly a count 
was made of the number of different faults 
which appeared on the papers of the "non- 
crutch" children and of the "crutch" chil- 
dren. The totals were found to be 49 and 




















T | . 

Frequency of Number of Pupils Per cent of Cumuletive 

Faulty | ____—Whole Group =| _s__—_—sC*Per_Cents 
Procedures Schools | Schools Schools | Schools | Tae 
3-7 1&2 3-7 lé&e 3-7 1&2 
More than 48 g* «6 ge 12 | 2.5 1.2 2.5 
48 0 0 0.0 0.0 1.2 2.5 
44 2 0 | 1.2 0.0 2.4 2.5 
40 2 Pre ts | 1.2 1.3 | 3.6 3.8 
36 . 49 | 3.0 2.5 | 6.6 | 6.3 
22 € | 2 3.6 2.5 10.2 | 8.8 
28 16 os 9.7 6.2 19.9 | 15.2 
24 7 5 4.2 6.2 | 24.1 | 21.4 
20 5 . | 4.8 7.5 28.9 | 28.9 
16 18 6 | 10.9 7.5 | 39.8 | 236.4 

| | 
le 19 11 11.5 13.8 51.3 | 50.2 
8 31 18 18.8 | 22.5 70.1 | 72.7 
4 21 ll 12.8 | 13.8 82.9 86.5 
) 28 11 17.0 | 13.8 | 99.2 100.2 
* actual frequency of faulty procedures, 60 and 65. 
**actual frequency of faulty procedures, 54 and 67. 











34 respectively. The difference of 15 seemed 
to be evidence of considerable unlikeness, 
with the "crutch" children holding distinct 
advantage. Still the difference in number 
of faults might have been due to the larger 
size of the “non-crutch" group or to their 
enrollment in five instead of two schools. 
Checks were accordingly made of the influence 
of these two factors. 

In order to secure a group of 80 "non- 
crutch" children who would be comparable with 
the 80 "crutch" children a sample of 80 was 
drawn from the 165 "non-crutch" children in 
such a way that the proportionate number con- 
tributed by each school was the same as its 
contribution to the total of 165. The num 
ber of different faults in the work of this 
sample of 80 "non-crutch" children was 38, 
Four other similar samples were selected, and 
their variety of faults was studied. ‘the 
range of different types was from 36 to 40, 
giving a median of 37 for the five samples. 
This figure, 37, was a better measure than 
was the 49 originally obtained, of the number 
of different faults to which the "non-crutch" 
children were subject. It differs by only 3 
from the corresponding figure for the "crutch" 





group. 
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Yet small as was this difference, it 
still failed to take account of the possi- 
ble influence from the unlike numbers of 
schools represented in the two groups. In 
one of the two "crutch" schools 30 different 
faults were discovered; in the other, 32. In 
the "non-crutch" schools the following num 
bers were found: 19, 20, 36, 21, and 2. 
RKetually, therefore, three of the five “non- 
crutch" schools used fewer different kinds 
of faulty procedure than did one of the 
"crutch” , and four of the five used 
fewer than did the other one of this pair. 
the larger total of 49, and the smaller one 
of 37 as given above, are thus seen to be a 
thoroughly unsatisfactory measures of the 
relative number of different faults to be 
found in the fraction habits of these "non- 
crutch" chilcren. A valid measure for this 
croup would tend towipe out entirely their 
apparent difference from the "crutch" chil- 
dren and would serve to emphasize likeness 
rather than dissimilarity in respect to the 
variety of their faults. 

The two preceding comparisons were be- 
tween the groups as wholes. In order to ex- 
press relative use of faulty procedures more 
precisely, the study was continued in terms 
of averages. ‘lo secure the first average, 
the individual children in the two groups 
were separately arranged in the order of 
the number of faults on their papers. ‘hen 
for each array the median was determined. 
this proved to be 12.5 in the case of the 
"non-crutch" children and was interpreted to 


scnools 


mean that the average "non-crutch" child used (‘the two comparisons based on averages 


faulty procedures 12.5 times. 
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| vantage, though slight, having changed from 
the "crutch" to the “non-crutch"” children.?} 
‘he second average used in the compari- 
son of the croups was derived from the num 
ber of children who emploved the various 
faulty procedures of their group rether than 
from the number of times these faults were 
exhibited. ‘l'abulations were first made to 
discover the number of "non-crutch" chilcren 
who had employed each of the faulty proce- 
dures which were used by their group. Then 
from this array the median war obtained, it 
being 3.5. ‘the corresponding median for the 
"crutch" group was2.9. In order to make 
the medians comparable, allowance had to be 
,made for the number of children in the two 
‘he medians were divided by 165 for 
the "non-crutch" group and by 80 for the 
"crutch" group. ‘rhe resulting measures, 
2.1 per cent and 3.6 per cent, represent the 
proportion of each group who on the average 


| used the faulty procedures of their group. 
|Wnen the basis of comparison was again 
changed to the 28 types of fault which were 
| common to the two groups, these figures be- 
came 3.2 per cent and 4.8 per cent.2 On both 
/bases the "crutch" children showed to a dic- 


advantage. 


| To summarize, the two comparisons of a 


more general nature, that based on Table | 


| showing number of times faulty procedures 


were used and that which took account of thu 
variety of faults for the two groups, seemed 
_to indicate that the "crutch" had had no 
|harmful or retarding influence on learnings, 
were 


‘he correspond-| opposed to such a conclusion, and gave some 


ing median for the "crutch" children was 11.5.| small amount of evidence in favor of the po- 


It was already known that the two groups used 


sition that the "crutch" actually interfered 


different numbers of particular faulty proce- |with learning. 


dures, and there was the chance that the 
above medians, which included all types of 
fault, might discriminate unfairly against 


one or the other of the groups. Accordingly 
those fe vere eliminated which were pe- 
culiar to either set of children, and medians 
were Com] nn the basis of the 2 faults 
which wer: ‘on to both groups. ‘The new 


median for the "non-crutch" group was 10.5, 


Up to this point the comparisons had 
neglected the nature of the different faulty 
procedures which were characteristic of the 

_"non-crutch" and of the "crutch" children. 

' The two groups could have been very much 
alike with regard to their use of faulty 
procedures in general and yet have been very 

unlike in the particular faults to which 

| they were subject. It was not necessary to 


and for the "crutch" group, 11.5, the ad- __| inelude in the comparisons all 55 types of 


1. In the case of neither of the comparisons was the difference reliable. The ratio of the difference in medians to the 
P.E-qirt, a8 .41 when all faults were included, and was .79 when only the 28 common faults were considered. 


2. The differences of 1.5 per cent were not statistically Significant, the ratios of the difference of the per cents to 





PAB s sire, being .91 and .59, respectively, in the comparisons described. 
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TABLE VI 


COMPARISON OF SCHOOLS 3-7 ("NON-CRUTCH" SCHOOLS) AND SCHOOLS 1 AND 2 
("CRUTCH" SCHOOLS) WITH RESPECT TO THE NUMBER OF USERS 
OF PARTICULAR TYPES OF FAULTY PROCEDURES 





Difference _ 
P.B. pier, 


Number of Users Percentage of Total Difference | P.E. 
Schools Schools Schools Schools in | of r 


3-7 1&2 7 1& 2 | Percentage | Difference* 


1s a | 
1l | 
5 
15 
84 
24 











ell 
-60 
21 
1.69 
-52 
-31 
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*Formula for the Probable Error of the Difference: 


6745 P39;\" , (P22 \" in which "p" represents the per cent of one vari- 

i “Ny “No able, "q" the per cent of the other, and "N" the 
number of cases. The figures "1" and "2" refer 
to the two series of scores. 





fault. On the contrary, 32 could be disre- | (D, G, M, U, and V) is this ratio as large 
garded from the outset because of the in- as 1.99. 

frequency of their appearance in either Next the 23 faults were studied from 
group. ‘Ihe remaining 23 types were first the standpoint of the number of times they 
studied in terms of the number of chilcren were used. ‘The data for this comparison 

who used them. The pertinent data are pre- | are found in Table VII, which is construct- 
sented in ‘'able VI, which is read: Fault A | ed like Table VI except for the basic unit: 
appeared on the papers of 15 of the "non- times used instead of number of users. In 
crutch" children and on the papers of 8 of the case of none of these faults is the ratio 
the "crutch" children. These figures repre- | of difference to P.E.ni-- aS much as 1.00. 
sent 9.1 per cent and 10.0 per cent of their Finally the five faults which of all 23 
respective groups. ‘ihe difference in per had seemed to show the largest amount of dif- 
cent is .9 point and theiP.K.pirr is 8.5. ference in number of users in the two groups 
The ratio of difference to P.E., is .l1l. | were subjected to examination with a view to 
The last column of Table VI is the “{mportant determining whether they were really typical 
one. In the case of only five of the faults | of the group which used them more frequently. 
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TABLE VII 
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Total No. Uses | Mean No. Uses Difference P.E. of Difference 
Type of Schools Schools Schools Schools in Means Difference* P.E. 
vena 3-7 | lee | 2&7 1&2 —_- 
enumsnnes —$ $$ $$$$_____— 
A 20 10 12 213 -01 40 -02 
B 106 71 | ~64 89 225 2.41 ell 
Cc 41 1 25 | -O1 24 -99 -24 
D 85 6 | .se | os | .42 1.23 34 
E 231 199 | 1.40 | 2.49 1.09 2.60 241 
F 69 25 | 42 | 31 .09 








G 1 16 | 01 

I 100 41 -61 

I 51 © 27 | ~32 

J 15 17 .09 

K 208 69 | 1.26 

L 05 | 
| 





M | 16 eee 09 
N | 15 4 09 
0 610 376 3.70 | 4 
P 5 27 203 
Q 40 19 24 
R 377 132 2.29 1 





s 42 13 225 
T 58 25 55 
U eee 15 eevee 
Vv 275 eee 1.67 . 
W 23 32 -14 











———— 











220 
252 
34 
el 
-86 
243 


05 
79 
34 
026 
65 


16 
31 
19 


eee 


40 


1.36 07 





| 

| 

| .09 2.38 04 
02 59 34 
okt 2.91 04 
40 









-09 -78 210 


04 54 08 
1.00 2.72 36 
| 31 | 1.14 27 
02 54 36 
64 2.73 +23 





-09 -88 210 
04 1.68 02 
19 1.11 el 
1.67 3.03 -55 
226 1.62 16 














“Formula for the Probable Error of the Difference: 





Three faults were more popular among the 
"non-crutch" children. 
exhibited by only fifteen of the 165 "non- 


crutch" children, and these fifteen were con- 


fined to three of the five schools in the 
group. Fault M was found in the work of 
only sixteen "non-crutch" children, and 
fifteen of the sixteen were furnished by a 
single school. Faulty procedure V was em 
ployed by twenty "non-crutch" children who 
were drawn from the five schools as follows: 
ten, seven, two, one, none. None of these 
faults was therefore really characteristic 
of "“non-crutch" children in general. Neither 


were Faults G and U characteristic of "crutch" 


children even though they did exhibit them 
more frequently than did the "non-crutch" 
children. Reference to Table VI shows that 
the first of these faulty procedures was 


2 > 2 
P.Be irr, = J P-Bewy * a, 





| 
} 
Fault D was, however, | 








used by only two children, and the second by 
only one child in the "crutch" schools. 

A fourth test was still available had 
it been necessary, to detect difference be- 
tween the groups in the types of fault they 
preferred. This test was one of the func- 
tional relation of the "crutch" to the 
fault. To have met the test a given fault 
would have had to be so closely related to 
the "crutch" that the use of the latter 
would have inévitably involved the user in 
the fault. The failure of any of the faults 
to pass the first three tests, or even two 
of the first three tests, made the fourth 
test superfluous. 

All things considered, therefore, there 
seemed to be little basis for the belief 
that the use of the "crutch" had interfered 
with learning to add proper fractions either 
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py making its users excessively dependent 
upon faulty procedures in general or by lead- 
ing them to adopt one or several of the dif- 
ferent types of these faults. 

(c) Relation between use of the "crutch" 
and accuracy in adding proper fractions. As 
a final check on the influence of the ‘trutcth" 
on learning the "non-crutch" and the "crutch" 
children were compared as to their relative 
success on the fraction test. Scores on 
this test were taken as being at least in- 
dicative of the degree to which the two 
groups of children had developed ability in 
that arithmetical process in connection with 
which the "crutch" had been taught inschools 
1 and 2. As measures of ability to add 
proper fractions the scores on the fraction 
test were not as satisfactory as they would 
have been, had it been possible at the time 
of the testing to have foreseen the use to 
which they were later put. 

The fraction testhad not been especial- 
ly devised for the purpose for which scores 
on the test were now to be utilized. The 
test was short;+ it was not timed; it yield- 
ed scores only in terms of accuracy, and it 
did not contain very many difficult examples. 
Then too the kinds of measures secured were 
distinctly limited. In order to furnish 
adequate data for a final answer to the 
question proposed, measures were required of 
other aspects of the ability to add proper 
fractions than those aspects which were cov- 
ered by the test, such as skill in adding 
such fractions in verbal problems, the ex- 
tent to which skill in adding proper frac- 
tions carried over to the addition of im- 
proper fractions and mixed numbers, etc. 

In spite of these limitations, however, 
there seemed to be some value in studying 
the relative success of these two groups of 
children in taking the fraction test. In 
case that extreme differences were found be- 
tween the groups, the results of the study 
would at least foreshadow, even if they did 
not decisively reveal, the general nature of 
the relationship of the "crutch" to profi- 
ciency in the particular arithmetical skill 
under investigation. Thus, if the "crutch" 
children were found to be vastly superior 
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| to the "non-crutch" children in adding the 
test examples, then it should be safe to con- 
Clude that the "crutch" had worked no great 

| hardships upon the former, such hardships as 

| the opponents of this "crutch" might easily 

| conceive on theoretical grounds. And, on 

| the other hand, if the "crutch" children 

| proved to be greatly inferior to the "non- 

| crutch" children, it would hardly be possi- 

| ble to claim for the “crutch" many of the 

| values which its proponents might be in- 

| clined to advance for it. 

| Table VIII contains a distribution of 

| the scores for the two groups of children 

| when credit was allowed only for completely 
correct answers. Thirteen of the "non- 

pee: children made perfect scores, as 

| did four of the "crutch" children, and so 

|on. The median score for the "non-crutch" 

| group was 7.3; for the "crutch" group, 6.9, 
'The difference of .40 in medians was found 
|to be unreliable. 

The method of scoring was then made 

/more lenient, and credit was allowed for an- 

| swers correct except for failure to reduce, 
Table IX presents the distribution of scores 
on this basis. It is read like Table VIII. 

|The difference in medians was now 1.4 in 

favor of the "non-crutch" group, a differ- 

| ence which met reasonable requirements for 

| statistical reliability. 

| An attempt was next made to discover 


| through an analysis of the relative achieve- 
|ment of the two groups on the three types 
| of example in the test a possible explana- 
tion for the comparative loss of efficiency 


on the part of the "crutch" children. The 
method adopted for this study may be ex- 
plained from Table X. Of the first type of 
example (denominators already similar) there 
'were five. In the "non-crutch" schools 
|119 children computed all five correctly, 

| 25 secured correct answers for four, six 
'for three, none for two, four for one, and 
‘eleven for none. The 165 children in these 
schools secured a total of 717 correct an= 
‘swers out of a possible 825 solutions (165 
times 5), making a per cent of 86.9. The 
corresponding figure for the 80 "crutch" 
|children on these examples was 85.8 per cent. 





1. In spite of this fact the test was surprisingly reliable. 
scores on the seven even-numbered examples .85. 


Scores on the eight odd-numbered examples correlated with 


The reliability of the 15-example test was therefore .91, after the 
coefficient had been stepped up by the Spearman-Brown formule. 


Application of this formula seemed to be justified by 


the fact that the statistical requirements for its use were met as well as by the fact that the content of the test- 


halves was closely homogeneous. 
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TABLE VIII 


COMPARISON OF SCHOOLS 3-7 ("NON-CRUTCH*® SCHOOLS) AND SCHOOLS 1 AND 2 ("CRUTCH SCHOOLS) 
WITH RESPECT TO SCORES ON THE FRACTION TEST; CREDIT ALLOWED ONLY 
FOR COMPLETELY CORRECT ANSWERS 


























Scores in Terms of Number of | Schools 3-7 ("Non-crutch" | Schools 1 & 2 ("Crutch" 
Completely Correct Answers | Schools) Schools) 
Sie 15 . | 13 T 4 
14 16 2 
13 7 1 
12 “ 6 
1l 7 1 
10 9 | 5 
3 8 6 
8 8 4 
7 16 | 10 
6 15 8 
5 10 38 
4 9 4 
| 3 10 | 4 
| 2 13 5 
1 7 6 
) 13 6 
: —— ——_$ —- ——- + - — —-—_—_~+-_- —_- - — -_-— —_— + - SS 
Total Cases 165 80 
Median Score | 7.8 6.9 
Q. Same | 4.0 3.0 
P Boys, ~39 | 42 
Difference in Medians 40 
P Bence, 57 
Difference -70 
PB ase, 
| 
The comparisons afforded by the last line of | In the attempt to discover the reason 
the table reveal the fact that the "“non- | for this special deficiency of the "crutch" 


crutch" children were very slightly superior | children the form was studied in relation to 
to the "crutch" children in the first and | 


_ the processes required to solve the type- 
third types of example (the third type re- two examples. Analysis was here resorted to 
quiring that the common denominator be 


“in the hope of isolating those properties of 
found),! but that they were distinctly su- | the “crutch” which would make its users en- 


perior in the case of the second type (one counter unusual difficulty with examples of 
of the denominators being the common denom- this kind. No such properties could be 
inator). Here the difference of 10.4 per found. A second type of analysis was then 
cent was 1.4 times the P.E.nrp.. On this | undertaken in the effort to determine wheth- 
kind of example in the addition of proper er the "crutch" children tended in type-two 
fractions the “crutch” children were no examples to vary the number, or kinds, or 
match for the "non-crutch" children (that | frequency of their faulty procedures. The 
is, of course, within the limits of the re- search revealed the fact that in this re- 
liability of the observed difference). spect, that of faults, the worl: of the 








1. Difference in type one, 1.1; PEs ns ee » 5.2. 
Difference in type two, 2.3; P.Esns ce , 4.8. 
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TABLE IX 


COMPARISON OF SCHOOLS 3-7 ("NON-CROTCH" SCHOOLS) AND SCHOOLS 1 AND 2 (*CRUTCH" 
SCHOOLS) WITH RESPECT TO SCORES ON THE FRACTION TEST; CREDIT 
ALLOWED FOR UNREDUCED ANSWERS 


William A. Brownell 





Scores: Credit Allowed for | 


Schools 3-7 ("Non-crutch" 


| 
| Schools 1 & 2 

















Unreduced Ansvers Schools) ("Crutch" Schools) 
15 51 26 
14 39 9 
13 22 4 
12 r 10 
11 5 | 6 
10 5 | 5 
9 5 | 3 
8 5 3 
7 3 oe 
. 2 | 1 
5 6 4 
+ 4 | 1 
3 ee } *e 
2 oa 1 
1 ef & 
0 12 5 
ee. eoene oe Se 
Total Cases 165 | 80 
Median Score 14.2 12.8 
Q. Same 2.2 | 2.5 
P.Beya, 22 235 
Difference in Medians 1.4 
P.EBspire. 41 
Difference 3. 41 
P.Bensice, 











"crutch" children was different for these 
examples from what it was for types one and 
three. With the second type of example 
these children not only used a greater num 
ber of different faulty procedures but also 
used them more frequently. At this point, 
however, the inquiry was blocked; an im- 
passe was reached when an explanation (for 
the change in faults) was sought in use of 
the "crutch." It was quite impossible to 
find anything in the nature of the "crutch" 
which could account for the discovered 
facts. 

The conclusions, then, regarding the 
third objection to be raised against teach- 
ing the "crutch" were five in number: first, 
the "crutch" children and "non-crutch" chil- 
dren seemed to be about equally skilled in 
securing completely correct answers to ex- 


amples in the addition of proper fractions; 
second, the “non-crutch" children seemed to 
be somewhat superior to the “crutch" chil- 
dren in this arithmetical process when un- 
reduced answers otherwise correct were given 
full credit; third, the "crutch" children 
experienced their chief difficulty with the 
type-two examples; fourth, the reason for 
this difficulty could not in any way be 
found in the nature of the "crutch" itself; 
fifth, as a matter of fact all the small 
variations noted in the ability of the two 
groups of children to add proper fractions 
might readily be explained as due to instruc- 
tional differences other than those associ- 
ated with whether the "crutch" had or had 
not been taught the children compared. 
Conclusion to Part II. The purpose of 
Part II was stated to be twofold: to present 
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TABLE X 


COMPARISON OF SCHOOLS 3-7 ("NON-CRUTCH" SCHOOLS) AND SCHOOLS 1 AND 2 
("CRUTCH" SCHOOLS) WITH RESPECT TO RELATIVE SUCCESS ON 
THREE TYPES OF EXAMPLES IN THE FRACTION TEST 





= 
Number ts Examples 1-5 Examples 6-9 Examples 10-15 

Examples Schools Schools Schools Schools 8chools Schools 
Correct 3-7 1&2 3-7 1&2 3-7 1&2 








119 
25 
6 


4 
1l 





Cases 


Total Ex- 
amples 
Correct 


Possible 
Number 
Correct 


Per cent 
Correct 























part of the basic data and to explain the Part II are necesscrily the same as 
measures by which those data were interpret- | those for Part I in as much as the two sec- 


ed. Part II has therefore no special con- tions of the report simply represent two 
clusions of its own; the conclusions for different methods of looking at the same 
problem and its solution. 
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"INTELLIGENCE" AND "PERSEVERATION" 


by 
K. H. Rogers, 
University of Toronto 


RELATED TO SCHOOL ACHIEVEMENT? 


This research explores the possibility 
of examining records of school progress in 
such @ way as to show the dependence of 
achievement upon certain measurable "factors," 
Since the development of group tests, many 
investigations have dealt with the relatim- 
ship between intelligence and school prog- 
ress. In the main, these studies have not 
gone very far beyond the statement of re- 
sults, or the postulation of special abili- 
ties, interests and the like. Recently, how- 
ever, Brigham contended that if techniques 
could be devised for isolating some of the 
factors other than intelligence which are 
necessary to explain achievement scores, an 
important field of research would be opened; 
for the understanding of the nature of these 
other factors (called by him "societal fac- 
tors") might contribute to our knowledge of 
important training differences which play a 
part in determining lines of interest and 
success (3). 

In recent years, a number of "factors" 
other than intelligence have been statisti- 
cally demonstrated by Spearman (12), Thur- 
stone (14), and Kelley (6). Most of the 
discussion concerning these "factors", how- 
ever, has been centered upon the validity of 
the techniques involved, or upon the gross 
relationships between the "factors" and oth- 
er major variables. Of these "factors", the 
one symbolized by Spearman as "p" ("persev- 
eration") has received most attention. But 
the examination of "p" has never been car- 
ried in the direction of its possible con- 
nection with the disparity between "g" and 
school achievement. 


| 


| in terms of: 
| importance of 





Accordingly, our problem may be stated 
(1) an attempt to see if the 
"intelligence" as reflected in 
measures of school progress varies in widely 
different educational settings; and (2) in 
the event of the correlation being compara- 
tively low under some conditions, does "p" 
play any part in explaining the fact of this 
low correlation? 

The first study, therefore, dealt with 
the achievement of subnormal patients in ac- 
ademic and manual fields at the Ontario Hos- 
pital, Orillia. By adopting certain of the 
techniques formulated by Spearman (12), an 
attempt was made to clarify the dependence 
of achievement upon general intelligence. In 
the second part of the research, similar re- 
sults for public school children of approxi- 
mately the same mental age were obtained. A 
comparison was thereby made possible between 
the results with subnormal and with normal 
subjects. Since this comparison indicated 
the dependence of public school achievement 
on factors other than "g", particularly on 
"perseveration", the third setting explored 
the relationship between "perseveration" 
scores and school progress. 


II, The Research 
A. Examination of Subnormal Children 
(i) Procedure 

The first experiment was conducted 
at the Ontario Hospital, Orillia. 

Records in all school subjects were 
available for a group of fifty-one cases (21 
girls, 30 boys). The age, mental age, and 
I.Q. ranges of these children are indicated 
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as follows: | It is impossible, from these data, to 
| determine accurately the relationship be- 
tween the "manual" subjects (manual training 





1.Q- 
6l. 


if 


Coke M.A. 


50-70 the other variables. For our immediate pur- 
pose, however, it appears to be sufficient 
to consider the following facts. The mean 
It will be seen that the cases were of the coefficients for the main part of the 
confined to the "moron" group, whose mental- | table (including all cases) is .837. The 
age level corresponds approximately to that | values in the “manual training” column also 
between the senior first and junior third give an average of .837; and for "sewing" 
grades in the Ontario Public Schools. the mean is .838. There is no evidence, 
The subject-scores available included therefore, of any "group factor" in this sit- 
those for arithmetic, reading, spelling, uation. 
composition, writing, social science, phys- The only conclusion possible from the 
ical training for all cases, manual training | above seems to be that the measures of 
for the boys and sewing for the girls. Com- | achievement here employed show a high depend- 
parable measures were obtained in all these | ence upon "g", but do not reflect the influ- 
subjects. ence of any other major factor. In other 
words, not only does this particular insti- 
(11) Results | tutional programme result in a high rela- 


| 
The intercorrelations of these re- (| tionship between "intelligence" and achieve- 


sults are shown in Table I. | ment=<@ fact that appears to be of consider- 
Two things will be noted immediate- | able educational importance--but also any 
ly concerning this table. The first is the difference between achievements in verbal 
uniformly high correlation coefficients. and non-verbal fields fail to make themselves 
The second, is the absence of any signifi- ‘evident. Especially noteworthy is the 
cant tetrad-difference in that part of the latter finding when we consider the fact that 
table involving all cases. The probable er- | "manual" subjects usually correlate to com- 
ror of the tetrad-difference is .028 (8) and | paratively Slight degrees with academic sub- 
the median ,03. | jects in the case of normal children in the 
| public schools. The greater degree of stand 
| ardization in the hospital generally, and 
‘more especially with respect to manual work, 
is probably an important factor in making 
Se SS a a the correlations high. But this only empha- 
. Hee ga ~ 69 = = = 4 sizes again the possible influence of well- 
; spelling oe Ke 20 "al 6S .71 directed education in endeavoring to main- 
Composition .89 .90 - 86 .82 tain a high correlation between "intelli- 
. Writing -85 .81 .89 - gence" and achievement. 
- Social 


aa +82 .86 86 .94 | B, Examination of Public School Children 
Training .71 .70 .82 .80 , 1. Intelligence and Achievement 

. > -69 The next step was to investigate the 

. Manual same relationship with children in the pub- 


Training 86 . ‘ ‘ ° lic scho 
Boys: 30 - 


cases 
. Sewing , : (i) Procedure 


Girls: 30 In order that the mental age 
cases range might be somewhat comparable with that 
Mean r = .82 of the earlier group of subjects, two class- 
Average p.e. of r = .042 es in the junior III grade were chosen. The 
N-51 cases (except where otherwise stated) | pupils of these classes were representative 


7 yrs. 6 mo. 
ll yrs. 5 mo. 


| } 
average | 16 yre. 1 ao.) © yee. 10 we. | for the boys and sewing for the girls), and 
| 
| 
| | 


Range 10 yrs. 1 mo. 
16 yrs. 2 mo. 














TABLE I 
CORRELATIONS BETWEEN ACADEMIC AND "4ANUAL" SUB- 
JECTS 
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of all the junior III pupils in the school. 
This was possible because of the approximate 
equalization of the groups in any one grade 
at Regal Road School, Toronto, resulting 
from the grading policy of the Toronto Re- 
search Division of the Canadian National 
Committee for Mental Hygiene. The pertinent 
data concerning this group are summarized as 
follows: 


Age Range: 9 yrs., 4 mo. to 13 yrs., 3 mo. 
1.Q. Range: 73 to 143. 
No. of Cases: 80. 


The achievement records in the main 
school subjects were obtained from the 
teachers of these classes. Since the re- 
sults of correlating the total school stand- 
ing at different two months! periods indi- 
cated satisfactorily high reliability (.89), 
the existing measures were deemed to be ade- 
quate to our purpose. For correlation pur- 
poses, the marks of the two classrooms were 
distributed separately and converted into 
sigma deviation units. "Intelligence" meas- 
ures were obtained by means of the Pintner 
Group Test. 


(141) Results 
Table II shows the intercorrela- 


tions obtained. it will be noted that the 
correlations are here much lower than was 
the case with the group of subnormals (see 
Table I). They vary from .87 to .07 with an 
average Of .452. With the subnormals, the 
mean of the correlations was .837 with a 
range of .92 to .70. While the school sub- 
jects involved in these tables vary somewhat, 
the difference in the results obtained is 
none the less striking. A similar differance 
has been found by other investigators, and 
will be discussed later. 

A second characteristic of this table 
is that a number of the tetrads indicate the 
presence of group factors. For example, one 
of the tetrad-differences obtained with var- 
iables, F, D, H, E is .262 with the probable 
error of .05. Other examples need not be 
quoted at this point, since the distribution 
of tetrad-differences is given later, 


Rogers 


TABLE II 
TABLE OF INTERCORRELATIONS 





A B 





- Total 
Standing - .87. 

- Compo- 
sition .87 - 
History.83 .64 

- Spell. .69 .65 . 

« at. .78 « 

- Pintner 
Intel- 
ligence 
Test .62. 

- Arith. .€7 
Read. .52 . 
Geog. .61 . 

- Writ- 
ing 40 








2. Factors other than "Intelligence" re- 
flected in achievement scores 
With the public school children, 











_ the low correlations obtained suggest that 
| the standards represented by school marks 


depend not only upon "g" but also upon some 
group factor or factors. Accordingly, con- 
sideration was given to the nature of these 


| latter. 


(1) Procedure 
In addition to the variable al- 
ready discussed (Table II), scores were ob- 
tained by means of the following tests: 
(1) "P" tests.- A battery of 4 tests 
was used, They were simple pencil-and-pa- 


| per tests based on the principle of "persev- 


eration” as investigated by Lankes (7), 
Bernstein (2), Pinard (11) and others, (For 
convenience, in Table III, the scores are 
reversed to give a larger number of positive 
correlations, so that they more truly rep- 
resent "c" scores.) 

(2) "“Alertness."= The tests reported 
elsewhere (8) were administered in order to 
obtain speed records with very easy intel- 
lectual tasks. 

(3) "Speed of Execution."- The number 
of letters, digits, etc., written in short~ 
time periods, under conditions of speed, 
served as the main principle here (8). 
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(4) "Fluency."= This test depended on 
the number of different ideas that could be 
written in five minutes, according to the 
techniques employed by Hargreaves (5). It 
was included as one of the variables in an 
attempt to see how far "fluency" and "per- 
severation" (inertia) were related as psy- 
chological manifestations. 

These test batteries were chosen mainly 
because they represented material with which 
group factors had already been isolated by 
previous investigators. With each of these 
batteries, fairly high reliability (self- 
correlation above .85) was obtained, with 
the exception of the "p" tests, where the 
material was not so easily administered. 

But even though the self-correlation here 


was only .5, the use of the results of these 


tests in the manner indicated below seems to 
be justified. 

The next step was to calculate inter- 
correlations throughout the whole table of 
variables. This table was then considered 
in the light of the tetrad-difference (Two 
Factor) method of Spearman (12). 


(11) Results 
The additional correlations 


(i.e., additional to those given in Table II) | 


are presented in Table III. The distribu- 
tion of tetrad-differences for the correla- 
tions of both tables combined is given in 
Table IV, 


TABLE III 
TABLE OF INTERCORRELATIONS 








K L i N 
A. Total Standing 13 09 -.22 -20 
B. Composition 223 ell -.14 220 
C. History 12 eil 09 10 
D. Spelling -28 el2 -.16 31 
E. Literuature -.05 00 -.04¢ -.04 
F. Intelligence 09 02 -.10 -.01 
G. Arithmetic -.1l OF -.37 08 
H. Reading 13 23 =- 10 232 
I. Geography Ol -.09 04 -.16 
J. Writing 37 31 -05 -33 
K. Fluency - ell 41 223 
L. Alertness ell - 13 017 
M. Speed of Execution .41 13 7 -.25 
N. Perseveration 223 ol? -.25 - 
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plausible reasons for his behavior (1 case); 
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The relations of reading, spelling and 


/writing with "c" are interesting, especially 


when contrasted with the apparent correla- 


| tion of literature and ceography with "per- 


severation."” Further detailed analysis pro- 
vided supplementary and corroborative data 
for these findings. As a contrast to geog- 
raphy, history was similarly analyzed, but 
showed no such clear-cut influence of "per- 
severation." 


TABLE IV 
THE DISTRIBUTION OF TETRAD-DIFFERENCES FOR 
TABLES II AND III 





Tetrad-differences Frequency 
46 to .45 A 
235 - 40 3 
00 - .d5 | 10 
25 - 30 38 
e200 - 25 125 
015 - .20 247 
10 - .15 492 
205 - .10 | 774 

O- .05 1313 
| 3008 





Median tetrad-difference = .O€2 
P.E. of tetrad-difference = +.028 


3. Some qualitative considerations 
A further lead to interpretation 
seems to be offered by studying the rela- 





tionship between factors such as "persever- 
ation" and "case history" material. Of the 
| children used in this inv. tigation, twenty- 


five had been examined by the Research Clin- 
ic in connection with behavior and educa- 
tional problems. Ten of these cases obtained 
"perseveration" scores that placed them 
in the low 25 per cent of the "persevera- 
tion" distribution, five were high “persev- 


_ erators" and ten were within the middle fif- 


ty per cent. The first group were reported 
to the Clinic as being: (a) badly behaved 
in school, restless, inatventive, unrelia- 
ble, careless, given to minor forms of mis- 
demeanour (3 cases); (b) involved in sex 
delinquencies in situations where the major 
responsibility was deemed to rest on other 
children present (2 cases); (c) shy, sensi- 
tive, nervous (1 case); (d) bright, attrac- 
tive, a trifle spoiled, restless (1 case); 
(e) sensitive, talkative, truant, ready with 


geptember, 1933 K. H. Rogers 
(¢) inattentive, weak in geography, careless 
(1 case). The five high "perseverators" 
were described as deceitful, lacking in en- 
thusiasm, unwilling to do particular work Sr. 38 1 
(1 case); very weak in reading and spelling Jr- 56 1 
(2 cases); the "worst" child in the room Ply 26 

(1 case); a sex delinquent, --the "worst" = te 


TABLE V 
THE SUBJECTS ACCORDING TO GRADES 


No. of Cases M.A. Range 


12.9 - 16.7 
11.6 - 15.5 
13.7 10.1 - 14.1 
13.4 10.9 - 13.8 
13.7 -10- 13.7 





Grade C.A. Range 





16.6 
14.8 


boy in the neighborhood (1 case). In con- | $F. 24 
trast, the ten children of the normal "per- 
severation" group were reported as being 

4ull (2 cases); for routine examination in 
connection with the history of a brother or 
sister (3 cases); health problems (2 cases) 


the "best" child in the class (2 cases); and | 


only one as a "nervous" case. While no con- 


clusions can be drawn from such scanty data, 


it is interesting to note the harmony be- 

tween the picture here represented and the 
findings of Pinard (11); for this investiga- 
tor found that "difficult" cases tended to 
show high or low "perseveration", the high 


their delinquency. 


Much elaboration of this pattern of re- 


search is necessary before we can interpret 
any of the factors further. The number of 
cases is small; and observations in differ- 
ent educational and social settings would 
have to be made before any definite inter- 
pretation could be offered. 


C. Relationship between School Achievement 
and "Perseveration" 

The data presented in the last sec- 
tion indicated that of the factors other 
than "g" reflected in school marks, "persev- 
eration" seemed to be the most significant. 
A further study was therefore made with a 
view to seeing if the relationship between 
"perseveration" and achievement could be 
more definitely determined. 








(1) Procedure 
For this purpose another school was 
chosen, These children were first examined 
by the Pintner Intelligence Test. The age, 
mental age ranges, and number of cases in 
each grade are shown in Table V. 

Then "perseveration" scores were secur- 
ed, using tests similar to those employed 
above, These were administered at approxi- 
mately the same time as were the "intelli- 
gence" tests. The school standing was de- 


24 
26 
20 


11.6 . = 11.9 
11.3 -4# - 11.10 
11.6 - 11.9 


aorowowowon 
. . . . . . 
CDMONNYH 3 


Total 


@ 
. 
oO 


16.6 16.7 














*i.e., 12 years, 7 months. 


termined by the marks given by the teachers 
on the examinations of the month immediate- 


| ly preceding. 


The reliability of the "p" tests is to 
some extent indicated by the correlations 


| given in Table VI. 
"perseverators” being the more persistent in 


TABLE VI 
INTERCORKELATIONS OF "p" TESTS 





| (1) | (2) 


| 


«26 (.04) 


| lee 28 











220 
27 


Number of cases: 
Average correlation: 
Average probable error: .04l 


By taking the first test against the second 
and third, a correletion of .45 was obtained, 
Comparing these values with similar ones re- 
ported in the literature (Pinard, Jones, etc,), 
they suggest that results obtained by a 
group examination show sufficient reliability 
to justify their use in the way here em 
ployed, 


(11) Results 
The "perseveration" scores were 
compared with age, mental age, and I.Q. The 
tables of results showed no significant trend 
of variation for "perseveration" as here 
measured in relation to any of these three 
variables, 

An interesting reletionship appeared, 
however, when the disparities between “in- 
telligence" and achievement of the high 
"perseveratives", who were below the average 
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of the cless in “intelligence”, were compare? 
vith the low "perseveratives", who were 
similarly below average in "intelligence." 
The term "disparities", as here used, can 
best be understood by referring to the way 
in which they were determined. For eac! 
classroom, the “intelligence” test scores 
were distributed separately and the point 
scores were translated into deviation (sig- 
ma) units. The same procedure was adopted 
in connection with the achievement marks. 
The difference between the respective devie- 
tion scores for any one individual constitu- 
ted his measure of disparity. Where the 
achievement rating was higher than that for 
"intelligence", the disparity was called 
"positive" (+); in other cases it was called 
negative (-). "High" and "low persevera- 
tives" refer to those cases who deviated 
more than one sigma in the "perseveration" 
distribution, that is, the positive and neg- 
ative extremes, 

Respecting the above-mentioned rela- 
tionship, it was found that whenever “intel- 
ligence” is below the average, the tendency 
is for disparities to be positive irrespec- 
tive of the "perseveration" ratine. 
sults for high and low "perseveratives" with 
more than average "intelligence" were dis- 
tributed, and in contrast with the foregoing, 
it appears that there is a preponderance of 
negative disparities among the children who 
score above the average in "intellicence", a 
total of 13 negative and 3 positive for the 
high "“perseveratives", and 15 necative and 
6 positive for the low "perseveratives." 

When the medium positive "persevera- 
tives" are combined with the high “persever- 
atives", and the medium low "perseveratives” 
with the low "perseveratives", the reneral 
indication is that the inclusion of cases 
who do not deviate markedly from the mean 
in "perseveration" reduces the tendency 
above mentioned, without, however, causing 
it to disappear. In other words, while it 
seems characteristic of children who 
low the average of their class in "intelli- 
gence" to manifest positive disparity (and 
contrariwise for those above the average in 
"intelligence"), this tendency is accentu- 
ated with both the high and low "persevera- 
tives,” 
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In order to obtain corroborative data 
on this point, the disparities of the high 
and low “perseveratives” throughout the 
school were compared with those pertaining to 
the medium croup. In the first place, the 
children in each classroom were divided into 

oups (marked off by the quar- 
tiles) on the basis of their "perseveration" 
scores, and then further classified in ac- 
cordance with their “intelligence” ratings as 
to whether they were above or below the av- 
erage. Again, as above, there was found a 
difference with respect to "perseveration." 
The tendency for the children of more than 
average "intelligence" to show negative dis- 
parity and for those of less than average 
"intellirence" to show positive disparity is 
much greater with the extremes of the "per- 
severation" distribution than with the oth- 
ers, 


four equel gro 


Further corroborative evidence was dem- 


| onstrated when the average amount of dispar- 
| ity displayed by these various "“persevera- 


tion" croups was determined, Table VII shows 
that in the fourth crade the high "“persever- 


| atives" who are above the averace in "intel- 


ligence” show an average negative disparity 
of .80; while the high "perseveratives” who 


TABLE VII 
THE AVERAGE AMOUNT OF DISPARITY 
(ACHIEVEMENT FROM "g"™) DISPLAYED BY THE 





"PeHKSEVERATION" GROUPS 
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| High x bl ip Low =p" as Midale 50 Re 
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4th; -.80 81 | -.96 | 08 | -.28 -60 
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ord; -.81 e900 | -.44 61 ~.06 | +54 
| | | 

2nd/-1.18 12.28 | -.02 o -.52¢ | 48 
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are be= 


are below the average "intellicence" show an 
| average positive disparity of .81. 


The oth- 


er values in the table, read similarly, again 


draw attention to the fact that the dispar- 
ity values tend to be higher with the high 


| and low “perseveratives" than with the mide 


die 50 per cent. This is especially true in 
connection with the hich "perseveratives" of 
the second grade and with the low "persever- 
atives" who are below the average "intelli- 


| gence" in this grade, 


September, 1933 


In order to make sure of the fact that | 
these figures are not seriously influenced 
hy the inclusion in the medium "persevera- 
-ion" group of both the medium positive and 
medium negative cases, it was decided to ob- 
tain corroborative correlational results. If | 
the above contention is true, those cases | 
who are within this middle 50 per cent group 
of the “perseveration” distribution should 
show greater correlation between "intelli- 
gence" and achievement than is the case with 
extremes (high and low "perseveratives"). 
The actual values obtained are .42 and .25 
respectively. In other words, while there 
is a tendency for all children who are below 
the average “intelligence” of their class to 
show positive disparity, and for those who 
are above the average "intelligence" to show 
negative disparity, this tendency is enhanced) 
in the case of both the high and low "pa 
severatives.” 


K. H. Rogers 





III. Summary and Discussion of Results 

1. The first study indicated the rela- 
tively high importance of "intelligence" in 
the academic and "manual" achievements of 
subnormal children. There seemed to be no 
definite manifestation of "group factors" in 
this setting. 

2. With a fairly comparable group of 
public school children, the relationship be- 
tween “intelligence” and achievement in 
school subjects seemed to be much lower than 
in the former case, and "factors" other than 
"intelligence" appeared to underly some of 
the varietions in achievement in different 
subjects. "Perseveration", as measured by 
the tests employed, seemed to be particular- 
ly important. Some relationship between 
this factor and behavior tendencies of the 
children is suggested. 

3. The third study investigated this 
factor "perseveration" more extensively by 
examining 220 children of the public school. 
There seemed to be no reletionship between 
"perseveration" and age, mental age, and 
I.Q. However, a relationship appeared to be 
indicated when "perseveration" scores were 
compared with "disparities" in achievement 
with respect to "intelligence." While there 
was a tendency, common to all grades inves- 














tigated, for the children of more than aver- 
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age “intelligence” for their grade to show 
negative “disparity”, and for those with low- 


er "intelligence" to show positive “dispar- 


ity", this tendency was increased in the 
case of the upper and lower 25 per cent of 
the "perseveration" distribution, 

The difference between the achievements 
of subnormal and normal children with re- 
spect to their dependence on "intelligence" 
has been noted previously. For example, 
Abelson (1) applied the same tests to both 
classes, and for normal children obtained a 
mean correlation of .466, while for the de- 
fective children the value was .782. Spear- 
man notes a similar condition present in com 
paring young and older children (12, p. 218), 
and in comparing children with adults (12, 
p. 219). Spearman's interpretation of this 
tendency is well known, (12, p. 220). He 
uses an analogy, the law of "diminishing re- 
turns", and states that the influence of any 
general factor in differentiating the achieve- 
ment of different individuals grows less as 
the factor itself increases. Here, while 


this fact may be adequate to explain our re- 
sults in part, we should note, among other 
possible interpretations, the following: 

(a) the range of talent is somewhat greater 


with the mental deficients here represented, 
than with the public school group employed 
in the second part of the study. This and 
other factors may therefore have influenced 
the reliability of the measures concerned, 
in favor of those pertaining to the hospital 
situation. Information available for our 
data, however, does not suggest that this is 
more than a minor factor in the situation. 
(b) The hospital setting may have contributed 
a "halo" effect, in that the pupils are 
being marked by teachers who associate with 
them continuously. But this is even less 
true of the hospital group than in the pub- 
lic school, since inthe latter one teacher 
only gave the scores that were used, while 
in the former several teachers were involved, 
Further, the marks available with the sub- 
normals were in many cases obtained from 
"objective" tests constructed by the princi- 
pal on the basis of the curricular demands, 
On the other hand, the careful standardiza- 
tion of the curriculum at Orillia, and the 
uniformity of the institutional background, 
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may contribute considerably to this phenom 
enon. 

Concerning the part played by "persev- 
eration” in school achievement, any inter- 


| 


| 
| 
| 
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(4) Psychological factors other than "¢" 
are definitely reflected in the subject- 


| achievements of normal school children, 


pretation at this stage must of necessity be 


very tentative. But the writer draws atten- 
tion to its possible relationship to school 
procedures as suggested by the results of 
the second study. The subjects favored by 
the "perseverative tendency" (notably, lit- 
erature and geography) would appear to give 
more scope for a contemplative type of think- 
ing, and this probably offers a lead to the 
interpretation of "perseveration." If this 
is the case, an interesting point arises re- 
specting the results obtained with history, 
where "perseveration" seemed to be of no 
significance, which suggests the possibility 
that a greater emphasis is put on isolated 
facts in this subject. 

Whatever the interpretation of "persev- 
eration” may be, the origin of this factor 
appears to be traceable at least to the ear- 
ly years, since it showed no significant 
increase with chronological age in our study. 
Since the incidence of high or low "persev- 
eration" has been found by Pinard to be un- 
usually high with problem cases--a fact that 
is to some minor extent corroborated by our 
results--the nature of early home training 
may have a great deal to do with the pres- 
ence of this factor. Further, when this 
‘factor is present in either extreme, it would 
appear to have some influence on the adjuste 
ment of the child to the demands of the 
classroom, as the results of disparity cases 
tend to indicate. 


IV. Conclusions 

(1) When the achievements of subnormal 
children are compared with those of normal 
children in the fields represented by 
school subjects, the former tend to show a 
much higher relationship with "intelligence" 
than do the latter. 

(2) The fact stated in conclusion (1) 
would seem to indicate a much more carefully 
directed educational situation in the case 


of the subnormals in this particular setting. | 


(3) It appears to be difficult to iso- 
late and interpret the significance of meas- 
urable factors other than "g" with subnormal 
children in a setting such as the one chosen 





(5) Of the factors referred to in con- 
clusion (4), "perseveration", at the present 


' time, appears to warrant the most emphasis 


for further study. 

(6) Within the age-range of the children 
studied (7 to 16 years), no evidence of in- 
creasing "perseveration" with age or with 
mental age was demonstrated. 

(7) "Perseveration" appears to be relat- 
ed to the tendency herein shown, for chil- 
dren of more than average "intelligence" to 
display negative disparity between "intelli- 
gence” and school achievement; and with the 
contrary tendency, for those of less than 
average "intelligence" to show positive dis- 
parity. High and low "perseverators"  dis- 


| Play these tendencies to a greater degree 
| than do those children who are within the 


| 
| 
| 
| 


| 
| 





medium ranges of the " perseveration" dis- 


tribution. 
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THE EFFECT OF PRAISE, REPROOF AND EXERCISE 


UPON 


MUSCULAR STEADINESS 


by 


Merrill T. Eaton 
Indiana University 


The use of praise or reproof to secure 
increased efficiency in individuals is per- 
haps more frequently resorted to than any of 
the other more common incentives. In educa- 
tion, in business, in athletics, in the 
home, and elsewhere the use of praise and 
reproof is an everyday occurrence. A clear- 
cut and satisfactory explanation of just 
what effect is produced in the individual 
when he is praised or reproved has, however, 
never been given. A survey of the investi- 
gations dealing with this problem, as weil 
as the opinions of various writers on the 
subject, would lead one to believe that 
praise or reproof es an incentive affects 
the behavior of the individual through cer- 
tain changes produced in his emotional 
state, 

An attempt at an analysis of this prob- 
lem leads one into many diverse fields and 
through many different types of investiga- 
tion. A review of some of the typical stud- 
ies in these fields shows the complexity of 
the problem, Gilchrist (7) investigeted the 
extent to which praise or reproof affects a 
student's work. In this study fifty stu- 
dents enrolled in a class in educational 
psychology, after they had taken an English 
test, were divided into two groups without 
any consideration of the scores they had 
made. ‘The members of Group I were told 
they had done very poorly on the test, and 
the members of Group II were informed that 
they had done exceptionally well. Both 
groups were then requested to take the test 
again. The group that was praised improved 
its score 79 per cent; the score of the re- 
proved group was lowered 6 per cent. The 
investigator attributed these results to 
the attitude set up by his words of praise 
and reproof. Hurlock (10), testing the ef- 
fect of praise and reproof, used school 


| children as subjects. 
|were given the children without the use of 
| any specific incentives, and on the basis 
|of their scores the subjects were divided 


| proved were approximately equal. 
_ control group, 52 per cent of the children 





Intelligence tests 


into three groups having identical averages 


on the intelligence test. ‘The three groups 
| were then required to take the test again. 

| For one group nothing was said about its 

| performance on the first test; another 

| group was praised for its high score’ and 
| neat work; the third group was reproved by 


being informed that its work was poor and 


| was told that it should be ashamed to make 


such a low score. All three groups made 
gains when the test was repeated. The gains 
for the groups wnich were praised and re- 
In the 


improved; in the praised group, 79 per cent; 
and in the reproved group, 80 per cent. Re- 
sults secured by the same investigator (9) 
in testing 106 school children from the 
fourth and sixth grades showed that praise 
was & valuable incentive for setting up a 
motivating condition within the learners 
which enabled them to do more and better 
work. Somewhat similar studies by Kirby 
(12), Gates (6), and Chapman (4) have con- 
firmed these results. 

Book (3), in commenting on the effect 
produced by praise or reproof upon a pupil's 
reactions and ability to learn, says: "We 
may conclude that praise by a teacher sets 





| up a favorable motivating condition in the 


learner that enables him to do more and 
better work." Bird (2) contends that: "As 
a form of social recognition, praise acts 
upon its recipient to arouse attitudes 
favorable to the project in hand; conse- 
quently, it improves the learning process." 


| Kitson (13), investigating the work of forty 
| Journeymen printers, found that when a 
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bonus was given for work completed in ex- 
cess of the average amount turned out 
daily, the average daily output for the 
group was improved 78 per cent. 

By far the most extensive studies deal- 
ing with the comparative merit of learning 
by reward and learning by punishment have 
been made at the Institute of Educational 
research of Teachers College, Columbie Uni- 
versity, under the direction of Thorndike 
(18). In interpreting these investigations, 
Thorndike says: "Other things being equal, 
an announcement of 'right' strengthens the 
connection which it follows and belongs to 
much more than the announcement of '‘'wrong' 
weakens the one which it follows and be- 
longs to." Punishment, however, he believes 
is not always futile, but the results of 
these studies should increase confidence in 
positive rather than negative learning and 
teaching. 

Griffith (8), in discussing the value 
of praise and reproof as incentives in ath- 
letics, states that he considers praise a 
more dangerous weapon than reproof, thereby 
indicating that he believes the greater 
emotional stimuletion resulting from praise 
is detrimental to the best performance of 
the athlete. Berry (1) says: "Ihe fact 
that athletics possess such a high emotion- 
al content is one of the greatest problems 
in coaching." Because of the high emotion- 
al setting in which athletes perform, care 
should be practiced in the use of praise 
and reproof. He states further: "As yet no 
one has learned the technique of this (emo- 
tional) phase of coaching." 

The investigations reviewed and the 
points of view enumerated above would indi- 
cate that really to measure the effect of 
praise and reproof some measures of the dif- 
ferences in emotional status of the individu 
al should be attempted. This problem has 
been attacked from many different angles, 
using various testing techniques. ‘Ihe gal- 
vanometer is an instrument used in one of 
the more complicated techniques. Symonds 
(17), however, considers it "a well-estab- 
lished method for detecting the presence of 
emotion.” Landis and De Wick (14) have made 
a critical review and summary of the litera- 
ture bearing on the psychogalvanic reflex. 
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None of the investigations mentioned 

| above, however, is closely related either 

| in purpose or technique to the present study, 

| but all illustrate the many diverse points 

of attack. Skaggs! (16) study was somewhat 

| similar in technique to the present investi- 

| gation in that he used a Whipple Steadiness 

| Tester to measure involuntary movements of 
the hand and arm. His stimulus situations, 
however, were of the "startle" type, and 

| praise and reproof were not involved. Kel- 

| logg (11) found that both the "exercise" and 

| the "shock stimuli" which he employed af- 

| fected muscular steadiness, causing the sub- 
| Jects to register more contacts per test. 

| He also found that the number of contacts 

| Was increased more after moderate physical 

| exercise than by the emotional situations 
which he produced by the “shock stimulus." 


PROBLEM 


As a review of the studies in the field 
| clearly shows considerable uncertainty exists 
| in regard to the effects which praise, re- 
| proof, or exercise actually produces upon 
| the performance of the individual. Many of 
| the testing techniques previously used have 
| necessitated the use of very complex and 
| elaborate apparatus or testing conditions 
/not fully controlled. ‘This investigation, 
| therefore, was undertaken to measure by a 
| very simple apparatus the changes in mscular 

steadiness resulting from praise, reproof, 
| or exercise. The experimental conditions 
|were arranged so that all testing was done 
/in the normal, natural learning environment 
|of the subjects, and a testing procedure was 
_followed which would tend to show definitely 
the effects of the various stimulus situa- 
tions that were arranged in the present ex- 
periment. ‘ihe chief aim of this study was 
to determine whether praise, reproof, or ex- 
ercise affected the muscular steadiness of 
men and women, and if so, how and to what 
extent. 


EXPERIMENTAL CONDITIONS 


The apparatus selected to 


Apparatus. 
measure steadiness was a plate-and-stylus 





tester of the Whipple type. The standard 
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apparatus, as supplied by Stoelting, was 
modified somewhat to adapt it better to 
the experimental conditions under which the 
tests were given. 
previously made by Kellogg (11) were as 
follows: "(1) A sliding metal cover was 
built in front of the contact plate to per=- 
mit the exposure of only one hold at a time. 
"his precluded the accidental insertion of 
the stylus into the wrong hole. (2) A 6- 
volt electric. lamp was connected through a 
relay in the plate-stylus so that it flashed 
whenever contacts were made. This light, 
mounted almost directly in S's line of vi- 
sion upon the upper edge of the contact 
plate, informed him when the stylus touched 
it and prevented him from falling uninten- 
tionally into the error of resting the sty- 
lus upon the circumference of the hole and 
so obtaining a spuriously high steadiness 


score"; (3) ‘the contact plate was mounted on 


a tripod, the tripod resting on a small 
table about thirty inches high. With this 
arrangement the contact plate could be con- 
veniently raised or lowered depending upon 
the height of the student. Since this ex- 
periment was conducted under absolutely nor- 
mal or natural learning conditions, the ap- 
paratus was located, in case of the men, in 
the men's gymnasium just outside the bound- 
ary line of the basketball court, opposite 
the center circle. In case of the women 
testing was done in the physical education 
building for women, the apparatus being lo- 
cated approximately opposite the center of 
the pool and as near the edge of the pool 
as was possible without water being splashed 
on the equipment. (4) All steadiness rec- 
ords were made with the subject standing. 
The contact plate was adjusted to the shoul- 
der height of each subject who stood at a 
distance from the apparatus equivalent to 
his arm's length. 

Subjects and Experimental Procedure. 
Forty-five persons, twenty-three men and 
twenty-two women, wereused as subjects. All 
the men were freshmen students at Indiana 
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|was undertaken or during the course of the 


ing period being near the close of the bas- 
| do their best, since those ranking highest 


| awarded "numerals." 
| these men subjects, the experimenter feels 
| sure that they took the experiment seriously 
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class; six were freshmen, eleven were sopho- 
mores, and five were juniors. The women 
subjects were tested during the month of May 
(1933) after they had been participating in 
swimming twice weekly since the beginning of 
the second semester, February 1, 1933. 

None of the subjects had any knowledge 
of the purpose of the experiment before it 


experiment. ‘!hey were informed that the pur- 
pose of the tests was to measure steadiness 
under various learning conditions. The test- 


ketball season, the men were motivated to 
| among the freshmen candidates were 


to be 
From remarks made by 


because they believed that the results of 
| the study would be considered in making the 
"numeral" awards. The experimenter was care- 
‘ful neither to deny nor to confirm this be- 
| lief. 
The women subjects were marked on the 
/amount of improvement they made in swimming 
‘during the semester, and since they also 
| thought the results of the experiment would 
| be considered in making up their final marks, 
the experimenter feels confident that they 
| too were motivated to do their best. 
| Each subject was given practice trials 
/until he was thoroughly familiar with the 
operation of the equipment. No attempt was 
| made to screen from the subjects the electric 
| counter for recording the contacts made, and 
all the men and women asked each time for 
their "score" (i.e., the number of contacts) 
made during the test period. In fact, they 
seemed to take special pleasure in knowing : 
that they were able to make a good steadiness 
"score." 

Preceding the test period, neither the 
experimenter nor his assistants! talked to 
the subjects. They came directly from the 
basketball floor or the swimming pool and 
were tested immediately. Following the 


University and were candidates for the frest-| test, however, the experimenter freely dis- 


men basketball squad. ‘the testing was done 
during the month of March (1933) after the 
men acting as subjects had been participat- 
ing in basketball practice all season. ‘rhe 
women subjects were members of a swimming 





cussed the "score" with the subject. Their 
introspections and comments were, however, 
not recorded, a fact that probably consti- 
tutes a weakness in the testing technique. 








1. Walter Marks, freshman basketball coach, and Miss Erna Van Valer assisted in the testing. 
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Preliminary tests on several basketball 
candidates and swimming class students whose 
results are not included in the present 
study indicated that a test period of ten 
seconds would be the most desirable. Kel- 
logg (11), using the same apparatus and a 
similar testing technique, found that if 
the test period did not exceed twelve sec- 
onds, with a short rest period intervening 
petween tests, fatigue could be almost en- _ 
tirely eliminated. 

The procedure followed in making the 
test was for the experimenter to adjust the 
contact plate to the shoulder height of the 
subject, hand the subject the stylus, have 
him place it in the hole in the contact- 
plate (the stylus was held in an horizontal 
plane and perpendicular to the face of the 
metal sheet) and then on the command "ready", 
the operator pressed the key that connected 
the stylus and contact plate with the elec- 
tric counter.which recorded the number of 
contacts and simultaneously started the 
stop watch. ‘When the ten seconds were up, 
the operator released the key, which broke 
the connection, thus avoiding any extra con- 
tacts after the ten second test period. By 
following this procedure, only the net num- 
ber of contacts in the specified ten second 
testing period was reccrded. 

The subjects were tested in six differ 
ent types of stimulus situations. The test 
"series" for the men consisted of ten tests 
for a norm. Five tests in all other situa- 
tions were given, from which the average 
score for each of the five types of stim- 
lus-situations was computed. In the case 
of the women, five tests constituted a se- 
ries for the norm and each of other five 
"series" of tests consisted of three tests. 
Not more than one test was given in any test 
series on the same day.t The same subjects, 
however, might be tested twice on the same 
day; i.6., one test in the norm series and 
one test in another of the test series, 


STIMULUS SITUATIONS 
The stimulus situations that could be, 


used in an experiment of this nature were 
determined through conference with the fresh- 
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| man basketball coach and the swimming in- 


| structor. 
| 


From this list of possible stim- 
ulus situations, the choice was narrowed 

| down to six. The six selected were of such 
| a nature that no artificial condition need 

| be established. Neither would the actual 
testing interfere with the work of the coach 
or swimming instructor. The situations se- 
lected are the following: 

1. Norm stimulus. The men subjects re- 
ported at the gymnasium at 2:30 P.M. dressed 
for basketball practice and then reported to 
the experimenter. They were asked to sit 
and rest for five minutes to offset any fa- 
tigue or emotional disturbance that walking 
to the gym and dressing might have aroused. 
They were then given a ten second period 
test. <A similar procedure was followed for 
the women, except that they reported at the 
physical education building for women at 
10:00 A.M. dressed for swimming. 

2. First exercise stimulus. The basket- 
ball coach's practice program consisted of 
having the men shoot baskets for the first 
thirty minutes. An assistant kept an ac- 
curate record of the time the subjects be- 
gan practicing, and after twenty minutes of 
basket shooting, the subject was called to 
take a test. In the case of the women, the 
assistant recorded the time the subject en- 
tered the water and after practicing a 
simple, overhand stroke with flutter kick 
for ten minutes, she was called to take a 
test. 

A difference of practice time between 
the men basketball subjects and the women 
swimming subjects necessitated a difference 
in the intervening periods between the tests 
for the different stimulus situations. 

The practice period for the men sub- 
jects was two and one-half hours and for the 
women forty minutes. The intervening time, 
therefore, between stimulus situations was 
considerably shorter for the women. The 
difference in time of the periods, however, 
is probably justified by the fact that the 
men had been participating in basketball for 
a considerably longer period of time and 
were, therefore, in much better physical 
condition than the women for the type of ex- 
ercise engaged in before the tests were given, 














1. The testing for women was not begun until late in the second semester and for that reason the number of readings in 
each test series was reduced so that there would be sufficient time to complete the testing before the subjects began 


preparing for final examinations. 
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3. Second exercise stimulus situation. 
The third stimulus situation was similar to 
the first, except that the practice period 
was longer. For the men the second exer- 
cise stimlus consisted of 65 minutes of 
practice, including basket shooting, funda- 
mentals, plays, and fifteen minutes of 
scrimmage; the fifteen minutes of scrimmage, 
which was closely observed by the coach, 
immediately preceded the testing. In case 
of the women, the second exercise period 
consisted of twenty minutes of practice on 
a simple overhand stroke. Five minutes of 
this practice that just preceded the test 
consisted of trial strokes under the obser- 
vation of the instructor. 

4. Praise stimlus situation. The con- 
ditions for the stimulus situation where the 
praise stimulus was givenwere exactly the 
game as those of the second exercise stimu- 
lus, except that in case of the men the 
coach called the subject to the side of the 
court, put his arm on his shoulder and 
there, where no one else could hear, praised 
the man for some phase of his playing. No 
subject, however, was praised unless he 
really deserved it, and the praise was not 
of the "flowery" type, but that generally 
used by the coach. The commending state- 
ments generally used were: "That perfectly 
timed pivot helped to pave the way for that 
basket. It was well done.” 

Not more than thirty seconds intervened 
between the time the subject was called from 
scrimmage and his reporting to the experi- 
menter to be tested. Incase of the women 
the instructor called the subject from the 
pool and praised her for the improvement she 
was making. The sudject was not praised un- 
less she deserved praise, and the commending 
statements were of the type the instructor 
generally used, such as: "Your coordination 
was excellent. You are improving very much." 

5. The Reproof stimulus. The condi- 
tions for the reproof stimlus situation 
were exactly the same as those for the 
praise stimulus situation and the testing 
procedure was identical, except that for 
this stimulus situation the subjects were 
reproved, They were not criticized, how- 
ever, unless they deserved it. The reproof 
used by the basketball coach was, however, 
not of the rough type, but consisted of con- 
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structive criticism; e.g., "If you had fol- 
lowed the path of your pass, you would have 
been in a good position for a follow-up shot, 
This is an essential point that you must re- 
member.” The swimming instructor also used 
constructive criticism suchas: "That was 
incorrect. The timing of your stroke shoulq 
be in better rhythm." 

6. Final exercise stimulus situation, 
The last test for the men was given at the 
end of two and one-half hours practice; for 
the women after forty minutes of swimming. 
No emotional situations were involved in 
this last test except those which might na- 
turally arise from the type of learning and 
exercise involved. ‘The subjects were neith- 
er praised nor reproved on the days that 
| tests were made for this stimulus situation. 








| RESULTS 


The objective data, average number of 
contacts per test series, constitute the 
criteria by which the changes in emotional 
status of the learner following praise, re- 
proof, and exercise are determined. Intro- 
spective data would probably have added to 
the validity of the conclusions, but such 
data were not recorded because an attempt 
to secure from the subjects following each 
test a statement concerning their "feelings" 
would have informed the subjects of the pur- 
pose of the investigation, thereby changing 
the testing environment from a normal, nat- 
ural learning situation to an artificial 
one. 

The procedure used, 1.6., individually 
praising or reproving the subjects should 
also add to the reliability of the data. 
Sims (15) found that individual motivation 
was considerably more effective for produc- 
ing changes in the emotional state of the 
learner than was group motivation. Add to 
this the fact that both the praise and the 
reproof bestowed on the subjects during the 
investigation were of exactly the same na- 
ture as that habitually used by the coach 
and the swimming instructor, and one may ex- 
pect that any changes in the average number 
of contacts for the different test series, 
following the stimuli employed, would actu- 
ally measure a difference in the emotional 
status of the learner. ‘Ihe writer feels 
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TABLE I 


AVERAGE NUM3ER OF CONTACTS PER TEN SECOND PERIOD FOR EACH STIMULUS SITUATION 





Subjects 
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*Numbers refer to the six types of stimulus situations. 
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justified in making this statement because Arranging the individual scores from 
he is measuring changes in the physiological | highest to lowest and plotting the result- 
condition of the individual. ing curves (Figure 1) there results a wide 
Individual differences. A study of the | range in the number of contacts made by the 
average number of contacts made by individu- | subjects for the different stimulus situa- 
al subjects for the six different stimulus tions. The number of contacts made by the 
situations (Table I) shows that there were | men for the norm ranged from 12 to 35, a 
marked individual differences in steadiness. | difference of 23 contacts between the most 
In fact, individual differences appeared in | steady and the least steady individual; the 
the first testing series (norm stimulus) | range was almost as great for the women 
and were apparent throughout the subsequent | (9-28), a difference of 19 contacts between 
testing periods. Other investigators in the highest and the lowest, 
this field have found similar differences, For Stimulus 3, the number of contacts 
indicating that normally some learners are made by the men varied from a low of 16 toa 
more steady than others and that different | high of 39, again showing a difference of 
stimulus situations affect individuals in | 23 contacts between the most steady and the 
widely varying degrees. The present data | least steady. The most steady woman subject 
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also indicate that there are important sex for Stimulus 3 registered 11 contacts as com- 
differences in the emotional changes result- | pared with 41 for the least steady individual, 
ing from the same stimuli. making a difference of 30 contacts between 
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the highest and the lowest. If one examines 
the steadiness curves (Figure 1) for the re- 
maining three stimulus situations, it is not- 
ed that the conditions described above are, 
in general, duplicated. 

A study of the individual record of 
each subject (Table I) comparing the number 
of contacts made for different stimuli shows 
the varying effect of the stimuli on the 
emotional status of the subjects. Three sub- 
jects, however, have been selected from each 
sex group as representative of the entire 
group as a means of making a more critical 
study of individual differences in steadi- 
ness for the different stimulus situations: 
the subject scoring highest (least steady), 
the subject scoring lowest (most steady), 
and the subject making the middle score. In 
cases where more than one subject made the 
highest, lowest, or middle score, the first 
subject appearing in Table I with that score 
was selected. 

The steadiness curves for the three men 
subjects and the three women subjects se- 
lected are shown in Figure 2, the curves on 
the left representing the performance of the 
men and the curves on the right showing the 
performance of the women. These curves por- 
tray the individual differences in steadi- 
ness resulting from the different stimli. 
It can readily be seen that S 19 (\M-left) 
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was Only mildly affected by a short period 
of exercise; praise caused only a slizht ad- 

ditional decrease in steadiness, but re- 
proof had a remarkably steadying effect by 
increasing the steadiness to a degree great- 
er than existed for the norm. The long ex- 
ercise period (stimulus 6) produced little 
change (5 contacts) over the norm. § 17 (MM 
left) was affected more by the two exercise 
stimuli, and praise further decreased the 
steadiness to a marked extent. Reproof af- 
fected S 17 to approximately the same extent 
as it affected S 15, and the long exercise 
stimulus produced slightly more effect. S 2 
(M-left) decreased only slightly in steadi- 
ness for the first exercise stimilus, but 
the second exercise stimulus caused @ pro- 
nounced change in steadiness. Praise did 
not cause any further decrease in steadiness: 
but reproof affected the emotional status, 
| increasing the steadiness to approximately 
| that of the norm. ‘he long exercise stimu- 
lus decreased steadiness more than that 
| Of either S 19 or S$ 17, as is show by the 
curves. 
S & (Vericht) was only mildly affected 

by Stimuli 2, 3, 4, and 5, steadiness de- 


tha 
valS 


creasing only slishtly after the first ex- 
t ‘radually increasing 
to Stimulus 6 where a 

i shown. 


| ercise stimulus, then 


, 
in steadiness up marked 
| decrease in steadiness was s 4 


Contacts 





ia “a Women ‘ 
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(W-right) decreased in steadiness for the 
first exercise stimulus, but steadiness in- 
creased for the second exercise period. 
Praise and reproof produced only a slight 
change in the emotional status, the curve 
rising slightly for praise and falling to 
almost the same extent for reproof. 

For Stimulus 6, however, S 4 was almost 
as steady as for the norm. S 1 (W-right) 
was affected considerably more for the 
first two exercise stimuli than was either 
$4 or S 18. Praise, however, produced no 
change in the emotional status, but reproof 
sent the curve upward higher than for either 
S's 4, 5, or 6.. For Stimulus 6, steadiness 
increased equal to that recorded for Stimu- 
lus 2. 

Although none of the individual curves 
for the other thirty-nine subjects is ex- 
actly similar to those presented in Figure2, 
some curves showing greater and some less 
individual variation in change in emotional 
status from one stimulus to another, a fur- 
ther study of these individual curves does 
not edd materially to our knowledge of how 
individual learners are affected by the dif- 
ferent stimuli presented in this study. 

A further study of the effect produced 
on the learners by the five different stimu- 
lus situations, however, does indicate that 
the emotional status of men and women is af- 
fected and to different extents by the same 
stimulus. A study of the six sample cases 
(Figure 2) shows that steadiness in both 
the men and the women was decreased (in dif- 
ferent amounts) by a short period of light 
exercise. A longer, more strenuous period 
of exercise, however, produced markedly dif- 
ferent results; for two of the women S's (4 
and 8) steadiness was increased, while for 
S 1 steadiness was decreased. For the three 


men S's, all showed a decrease in steadiness, 


S 19 being affected very little, S$ 17 toa 


greater extent, and S 2 to a mich greater de- 


gree. 

Again there is noted a peculiar situa- 
tion when the performances of men and women 
are compared on Stimulus 4 (praise). ‘rhe 
curvs for S 17 (M) rose abruptly, S 19 went 
up slightly, and S 2 retained the same level 


as for Stimlus 3; S 8 (W) went dow slight- 


ly following praise, S 4 rose slightly, and 
S 1 retained the same level as for Stimilus 
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3. ‘The three curves for Stimulus 5 (reproor) 
for the men drop rapidly and to approximately 
the same extent. For the women the curve for 
S 1 rose considerably; S 4 fell slightly, and 
S 8 retained the same level as for praise, 
Sex differences again appeared in Stimulus 6, 
All of the curves for the three men rose; 

S 17 and S 19 went up but slightly, while S2 
rose to a marked extent. For the women, S's] 
and 4 registered a considerable increase in 
steadiness, while S 8 showed approximately 
the same decrease in steadiness. 

After noting how the different stimuli 
affect the steadiness of the subjects one is 
interested in the relationship existing be- 
tween the increase or decreases. Inan at- 
tempt to secure an adequate measure of this 
factor, the correlation coefficients in ‘'a- 
ble II were computed. ‘hese were obtained by 
correlating the contact scores (Table I) for 
the different stimulus situations. The rank 
difference method of finding correlations was 


TABLE II 


CORRELATION COEFFICIENTS BETWEEN 
STIMULUS SITUATIONS 











Stimulus Men Women 

Situation r P.E.r| N r P.E.r N 
1-2 -79 -06 | 23 |.85 04 22 
2-3 48 ell | 23 |.76 06 22 
3-4 224 e14 | 23 |.21 13 22 
4-5 27 e1l3 | 23 | .60 -10 22 
5-6 39 o13 | 23 |.42 12 22 
1-6 42 -1l2 | 23 | .84 04 22 
2-6 225 o1l4 | 23 |.77 06 22 
3-6 50 ell 23 | .86 04 22 
4-6 29 e13 | 23 |.28 14 22 
3-5 30 13 | 23 | .66 -08 22 
1-4 239 olAZB | 25 |.82 13 22 
1-5 76 -0O6 | 23 | .46 -12 22 























used and the resulting P's were transmuted 
into product-moment r's (5, p. 191). Only a 
few of these correlations may be considered 
substantial or high and by far the majority 
are low. It is observed that the few high 
correlations exist between the same type of 
stimulus situations; the correlations between 
exercise and praise and between exercise and 
reproof are low, indicating that changes in 
the emotional status of the subjects were 
due to the effect of the praise or reproof 








and not to the effect of exercise, 
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Group differences. A comparison of 
group performance offers a more satisfactory 
and perhaps a more reliable means of study- 
ing the changes in the emotional status of 
the subjects resulting from different stim- 
ulus situations. Studying the sex groups 
separately (Table I and Figure 3) it will 

be seen that the average number of contacts 
made for each stimulus situation by the men 
varied considerably from one stimulus situ- 
ation to another. For purposes of group 
comparisons, it adds to the clearness of the 
discussion if the normal steadiness is: 

(1) compared with the changes in steadiness 
resulting from the exercise Stimuli 2, 3, 
and 6; (2) with praise and reproof; and 

(3) between exercise and praise and reproof. 
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Figure 3. Average Number of Con- 
tacts Made by Men and Women S's for Each 
Stimulus Situation. 


Men. (1) The average number of contacts| 


for the norm stimulus was 20; for Stimulus 2, 
25; for Stimulus 3, 32; and for Stimulus 6, 
36. The difference in number of contacts be- 
tween Stimuli 1 and 2 is 5; between Stimuli 

2 and 3 is 7; between Stimuli 3 and 6 is 3; 
between Stimuli 1 and 6 is 16; and between 
Stimuli 2 and 6 is 1l. ‘The first point to 
be considered in relation to these differ- 
encesis their reliabilities (5, pp. 128-133). 
In Table III the statistical reliability of 
the differences between the different stim- 
li is computed. Thus, it is seen that the 
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differences between the norm and the three 


exercise stimuli are all statistically reli- 
able. This shows that exercise (basketball) 
will produce a difference in steadiness, 
whether the period of exercise be of short, 
long, or medium duration. 

(2) Ihe average number of contacts made 
on Stimulus 4 was 41 and on Stimulus 5, 29. 
Comparing these averages with the norm (20) 
shows that steadiness was decreased by praise 
21 contacts and by reproof 9 contacts. Re- 
ferring to Table II, we find that these dif- 
ferences are reliable. A difference of 12 
contacts was made between praise and reproof 
and this difference is also reliable. Both 
praise and reproof decrease the steadiness 
of the S's, and the decrease by praise and 
reproof is a reliable difference. 

(3) The most interesting comparisons of 
differences between the various stimulus sit- 
uations used are those between the exercise 


| Stimuli 3 and 6, and Stimuli 4 and 5, re- 


proof.+ The difference between the average 
number of contacts for Stimuli 3 and 4 shows 
that steadiness which is decreased by nine con- 
tacts is reliable; between Stimuli 4 and 6 
steadiness is increased, as is shown by a 
difference of five contacts. The obtained 
difference of five contacts is 2.00 times the 
standard deviation of the difference, 2.50, 


| showing that the chances of praise (after a 


short period of exercise) causing more change 
in the emotional status of the learner than 
does a longer period of more strenuous ex- 
ercise are 980 in 999+. ‘here is a differ- 
ence of three contacts between Stimuli 3 and 
5 which is almost statistically reliable (997 





in 999+), indicating that there is a tendency 
for steadiness to be decreased more by re-=- 
proof than by a period of exercise of medium 
length. Between Stimuli 5 and 6 there is a 
difference of seven contacts--this shows 
there is virtually a true tendency for the 
long period of strenous exercise to cause a 
greater decrease in steadiness than does re- 
‘proof. 

Women. (1) The average number of con- 
tacts made by the women for each of the ex- 
ercise stimuli is as follows: Stimulusl, 18; 
| Stimulus 2, 22; Stimilus 3, 23; Stimulus 6, 
21. The differences between the averages of 
these stimuli are not so great as they were 








1. The stimulus situations for 5, 4, and 5 were exactly the same except that praise was added to Stimulus 4 and reproof 


to Stimulus 5. 
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TABLE III 
RELIABILITY OF THE DIFFERENCES BETWEEN DIFFERENT STIMULI 
MEN 
Stimulus Situations 
- a. 2 fs 6 1 6 2 6 3 6 
Ave. | 20 25 | 22 36 20 36 25 36 ae 36 
em. sre | ss 6.71 | 10. 8.53 | 5.76 | 8.53 | 6.13 | 8.53 | 6.71 | 8.53 
S.D. Ave. |1.20 | 1.26 | 1.40 | 2 1.78 | 1.20 | 1.78 | 1.28 | 17: | 1.40 | 1.78 
Diff. in, 5 7, |® 7 16 11 4 
8.D.diff. -80 | 2.20 1.07 2.41 1.68 1.91 1.62 
D 6.25 | 3.91 | 6.54 2.90 9.52 5.76 2.47 
S.D.daiff. 
Chances in 999+ 999+ | 999+ 9398 999+ 999+ 993 
1000 of 
ia Sas eee 
3 5 4 6 #. 1. 4 1 5 
Ave. se | 29 | 41 | 41 20 29 
S.D. 6.71 |11.87 |10.50 10.50 | 5.76 |11.87 
S.D. Ave. | 1.40_ 2.47 | 2.19 | 1 20| 2.19 | 1.20 | 2.47 
Diff. 7 5 21 3 
S.D.diff. 2.45 2.50 2.05 1.74 
D 2.86 2.00 10.24 3.45 
S.D.aiff. 
Chances in 997 980 999+ 999+ 
1000 of 
real diff. 
TABLE IV 
RELIABILITY OF THE DIFFERENCES. BETWEEN DIFFERENT STIMULI 
WOMEN 
| Stimulus Situations 
j 21 2 3 4] 5 6 1 6 2 6 3 6 
Ave. 18 22 23 20 | 24 21 18 21 22 21 23 21 
S.D. 6.23 | 9.68 | 6.24 | 5.45 | 6.50 | 7.92 | 6.33 | 7.92 | 9.68 | 7.92 | 6.34 | 7.92 
S.D. Ave. | 1.35 | 2.06 | 1.35 | 1.16 | 1.39 | 1.69 | 1.35 | 1.69 | 2.06 | 1.69 | 1.35 | 1.68 
Diff. 4 1 3 4 3 3 1 2 
S.D.diff. 1.15 | 1.35 | 1.36 | 1.11 | 1.68 -85 1.31 1.76 
D 3.50 -74 | 2.17 | 3.60 | 1.79 3.53 -76 1.14 
8.D.daiff. 
Chances in 999+ 770 985 999+ 960 999+ 770 870 
1000 of 
real diff. 












































Ave. 8 
8.D. 6.34 | 6.50 | 5.45 | 7.92 | 6.33 | 5.45 | 6.33 | 6.50 
8.D. Ave. 1.36 | 1.39 | 1.16 | 1.69 | 1.35 | 1.16 | 1.35 | 1.39 
Diff. 1 1 2 6 
8.D.diff. 1.13 1.76 1.40 1.42 

D 88 57 1.43 4.22 
8.D.diff. 
Chances in 820 7.15 925 999+ 
1000 of 
real diff. 
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TABLE V 


GROUP VARIATIONS BETWEEN DIFFERENT STIMULI 























ABYC MN RE DON 























7" eo Increase in No. of Contacts Decrease in No. of Contacts Same 
Men Women | Men | Women u | W 
ae Stimuli No. Increase No. Increase No. Decrease No. Decrease No. Mo. 
. ‘1 to 2 21 1 to 10 17 1 to 18 1 1 2 | 1 told 1 3 
2 to 3 20 1 to 16 14 2 to 13 1 1 7 i toe 2 1 
3 to 4 17 1 to 34 4 3 to 19 5 1 to ll 17 | 1 to ll 1 1 
4 to 5 1 15 15 3 to 16 22 2to 33 | 6 | 2to 7 0 1 
5 to 6 17 1 to 20 5 1 to 13 2to 6 | 16 | ltolé 0 1 
1 to 3 23 3 to 19 18 1 to 14 ) 1 | 2 0 3 
1 to 6 23 2 to 34 16 1 to 14 0 5 lto 4 0 1 
3 to 5 10 2 to 10 11 1 to 12 1 to 20 | ll lto 8 0 ie) 
3 to 6 18 1 to 25 5 1 to 11 4to 9 | 17 | 1 to 11 2 0 
4 to 6 8 1 to 14 11 1 to 13 Btos | 9 | 1 to 2l 0 2 




















for the men (Table IV); however, the differ- 
ence (four contacts) between the norm and 
Stimulus 2 (three contacts), and between 
the norm and Stimulus 6 are reliable. This 
shows that there is a true tendency for both 
the short and long periods of exercise in 
the water to decrease steadiness. The dif- 
ference of one contact between Stimuli 2 
and 3 shows that the probability of a long- 
er period of exercise in swimming producing 
a decrease in steadiness is only slightly 
more reliable than chance. A much longer 
period of exercise in the water increased 
steadiness, but the obtained difference of 
two contacts is not sufficient to show a 
true tendency for this to occur. 

(2) The average number of contacts made 
on Stimulus 4 was 20, and on Stimulus 5, 2. 
Comparing these averages with the norm (18) 
shows that steadiness was decreased but 
slightly“by praise (two contacts), but to a 
more marked extent by reproof (six contacts). 
The obtained difference of two contacts be- 
tween the norm and praise is not entirely 
reliable, but the reliability is high enough 
(925 in 999+) to show a tendency for praise 
to reduce the steadiness of the subjects; the 
difference of six contacts between the norm 
and reprocf is entirely reliable, which fact 
shows a true tendency for reproof to reduce 
the steadiness of the subjects; the differ- 
ence of four contacts between praise and re- 
proof is statistically reliable, showing 
that there is a true tendency for reproof to 





decrease the steadiness of the S's to a 
greater extent than does praise. 

(3) A comparison of the average number 
of contacts made on the exercise Stimuli 3 
and 6 and Stimli 4 and 5, praise and re- 
proof, presents an interesting study. ‘The 
difference between Stimuli 3 and 4 is three 
contacts, indicating that praise causes an 
increase in steadiness. -The obtained dif- 
ference of three is, however, only 2.17 
times the standard deviation of the differ- 
ence, 1.39, so the chances of this increase 
in steadiness representing a true tendency 
are 985 in 999+, The obtained difference of 
one contact between Stimuli 4 and 6 is suf- 
ficiently large to insure more than an even 
chance (715 in 999+) that a long period of 
strenuous exercise in the water will tend to 
reduce steadiness more than praise. The 
difference of one contact between Stimuli 
3 and 5 is only .88 times the standard devi- 
ation of the difference, 1.13, so the chanc- 
es of this increase representing a true 
tendency are 820 in 999+. Between Stimuli 
5 and 6 there is a difference of three con- 


| tacts, which indicates that long, strenuous 


exercise in the water will increase steadi- 
ness more than reproof. This increase rep- 
resents a true tendency, 960 chances in 999+, 
Another method of showing group varia- 
tions in the effect of various stimulus sit- 
uations upon steadiness is presented in ‘la- 
ble V. ‘this table shows the number of sub- 
jects whose steadiness was increased, remained 
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TABLE VI 


RELIABILITY OF SEX 


DIFFERENCES 





Ave. 

8.D. 

8.D. Ave. 

Diff. 

§.D.aiff. 
D 

S.D.diff. 

Chances in 


1000 of 
real diff. 
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the same, or was decreased by the various 
stimulus situations for which they were 
tested. 

Sex differences. A comparison of the 
effect produced on the emotional status of 
the men and women groups shows: (1) that 
the same stimuli affected the steadiness of 
men and women to different extents and that 
(2) the women were steadier than the men. 

An examination of the steadiness curves 
(Figure 1) shows that for all six of the 
stimulus situations, the women were steadier 
than the men, ‘Ihe norm for the men was two 
contacts higher than for the women, which 
shows that the chances of this difference 

in steadiness representing a true tendency 
are 860 in 999+ (Table VI). A short period 
of light exercise decreases the steadiness 
of men more (three contacts) than of women. 
The chances that this difference is a true 
tendency are 885 in 999+. The obtained dif- 
ference of nine contacts between the steadi- 
ness of men and women on Stimulus 3 is sta- 
tistically reliable, showing that there is 

a true tendency for a period of more strenu- 
ous exercise to decrease the steadiness of 
men more than of women, Following praise 
the men registered a degree of unsteadiness 
of twenty-one contacts greater than the 
women, which is a true tendency for men to 
be less steady after praise than women. The 
steadiness of both the men and the women was 
affected by reproof, the men by five more 
contacts than the women. ‘I'he chances that 
this difference rperesents a true tendency 
are 960 in 1000. ‘the difference of fifteen 








contacts between the degree of steadiness 
for Stimulus 6 is entirely reliable. 

An examination of Figure 3 shows that 
Stimuli 2 and 3 caused a decrease in steadi- 
ness of both men and women; that Stimlus 4 
(praise) caused a marked decrease (nine con- 
tacts) in the steadiness of the men, while 
the steadiness of the women was increased 
three contacts. Exactly the opposite is 
shown for Stimulus 5 (reproof); the steadi- 
ness of the men was increased by three con- 
tacts while that of the women was decreased 
by four contacts. For Stimulus 6 the steadi- 
ness of the men was decreased, but not to 
the same extent as in the case of praise, 
while the steadiness of the women was in- 
creased as compared with the results in the 
case of reproof. 


INTERPRETATION OF RESULTS 


It is significant: (1) that the three 
different types of stimuli--exercise, praise 
and reproof--used in this investigation pro- 
duced changes in the emotional status of the 
learner as is shown by the differences in 
steadiness; (2) that the same stimuli af- 
fected the emotional status of the subjects, 
producing different degrees of steadiness; 
(3) that the same stimuli produced reliable 
sex differences in steadiness; and (4) that 
the results possess a high degree of re- 
liability. 

The first three of these results com 
pare favorably with the findings of other 
investigators who have studied the effects 








september, 1933 


of various stimuli on the emotional status 
of the learner. The last finding, however, 
is not substantiated by earlier investiga- 
tors. Kellogg (11, p. 151) in discussing 
the effect of a "startle" type of stimulus 
on men and women says: “It is by no means 
apparent from our results that the women 

as a class were markedly more affected than 
the men." Skaggs (16) says that women are 
more emotional than men because they were 
more afraid of the electric shock than were 
the men. This conclusion, however, was 
based on introspective reports and upon 

the general observations of the investigator, 
and not upon a study of the objective data. 
A study of his tabulated figures indicates 
that the men were more upset than the women. 

That women are more emotional than men 
seems to be the common opinion among laymen. 
The data here reported do not support this 
common belief and seemingly disprove the 
data of Skaggs. 

It is to be remembered, however, that 
there were a number of factors which were 
not entirely controlled in this investiga- 
tion, any one of which possibly could have 
been responsible for the sex differences. 
These partially uncontrolled factors were as 
follows: 

1. The types of exercise participated 
in by the groups were of a different nature. 

2. The length of the exercise periods 
differed: 

a. Two hours and thirty minutes for 
the men. 

b. Forty minutes for the women. 

3. The praise and reproof were given by 
different persons: 

a. By the freshman basketball coach 
for the men, 

b. By the swimming instructor for 
the women. 

4. There was a difference in the aims 
of the two groups: 

a. The men were striving to earn 
the freshman basketball numeral and the 
prestige that accompanies it. 

b. The girls were merely fulfilling 
a part of the physical education require- 
ments of the university. 

Such adjustments as were made in the 
attempt to control these factors were de- 


|al status are due to fatigue or to 
| physiological changes must be determined by 
| other investigators. 


| cured from the women. 
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scribed in the explanation of the experimen- 


| tal conditions. 


The individual changes in the emotional 


| Status of the men indicate that, in general, 
| the subject with a high norm (less steady) 


also will rank high after a short period of 


| light exercise, and that subject with a low 
norm (steady) will rank low after a_ short 
| period of not too strenuous exercise. For 


a longer and more strenuous period of exer- 
cise, knowledge of the normal steadiness or 


| lack of steadiness of the learner possesses 
| little reliable predictive value (Table II). 
| The data do not give the reasons for this 


condition. Whether the changes in emotion- 


other 


Considering group performance, praise 


| caused a decrease in steadiness in the men 
_and reproof produced an increase in steadi- 


ness. The uniformity by which the subjects 
reacted to these stimuli and also the re- 


| liability of the obtained differences indi- 
| cate that they possess valid predictive 
| value. 


The findings concerning the effect of 


| praise and reproof on the steadiness of the 
| Subjects might be criticized on the basis 

_that the men were generally praised on "off 
_days" (when they were normally unsteady) or 


when for some other physiological reason 
they could not play as well as normally, 
thereby decreasing their steadiness, and 


| that they were reproved on the days when 


they were at their best. The same criti- 
Cism might be directed at the results se- 
Our testing proce- 


| dure, however, would offset the probability 


that this criticism might be just and the 


| correlations (Table II) between the differ- 


| 


ent stimuli show that there was no prefer- 
ence of days for praise or reproof. 

The data show that a short period of 
exercise in the water decreased the steadi- 
ness of the women, while a longer period of 
exercise tended to increase their steadiness 
to approximately a normal degree. The fact 
that a few minutes of exercise in the water 
decreased the steadiness of the women while 
a longer period of exercise increased their 


| steadiness is not answered by the data. This 
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condition may be due to the shock (chill) 

of first entering the water, to adaptability, 
or to some other physiological factor. The 
findings on this point are, however, in har- 
mony with the common opinion of swimming in- 
structors and swimming participants who be- 
lieve that swimming is the best type of ex- 
ercise in which to participate because of 
its steadying effects on the "nerves" of 
the individual. 

The sex groups as well as individuals 
of the group are affected to a different ex- 
tent by praise and reproof. Women under the 
influence of a stimulus of praise are more 
steady than men. When reproved the condi- 
tions are exactly reversed; the women are 
less steady than men. The fact that reproof 
produced a steadying effect on the men might 
be due to the fact that the criticism was 
constructive, which acted more as a check on 
the learner's progress than it did to pro- 
duce an effect on his emotional status. 
This, however, did not explain the differ- 
ences existing in the different effects of 
praise and reproof on the sexes. The fact 
that women are steadier than men in every 
stimulus situation involved in this investi- 
gation seems to be an important finding, and 
the fact that these sex differences possess 
a high degree of reliability should add to 
the significance of the results. 

The results of this investigation seem 
to point to many pertinent facts concerning 
basketball coaching. Whether the emotional 
status of the players is closely related to 
the length of practice periods, to the time 
a team should be on the floor before a game, 
when practice on "free throws" should be 
conducted, the length of scrimmage sessions, 
and other problems of a like nature, the 
author will attempt to determine by a more 
extensive investigation. 

If subsequent investigations verify the 
results of this study, showing that swimming 
tends to produce an increase in the steadi- 
ness of the individual, then these results 


would be of considerable practical importance; 


swimming should be recommended for "nervous" 
individuals, 


SUMMARY 


To study the effect of various stimuli 
on the emotional status of the individual as 
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expressed through changes in steadiness, 45 
subjects-- 23 men participating in freshman 
basketball and 22 women participating in 
swimming--were given a series of steadiness 
tests, during which various degrees of ex- 
ercise stimuli, praise stimuli, and reproor 
stimuli were presented without the learners 
being aware of the purpose of the investiga. 
tion. An improved plate-and-stylus tester 
of the Whipple type was used to measure 
steadiness. Three types of exercise stim- 
li were presented: (1) a short period of 
light exercise; (2) a longer period of more 
strenuous exercise;and (3) a long period 
of strenuous exercise. The exercise for 
the men consisted of basketball practice, 
and for the women, swimming practice. The 
praise (4) and reproof (5) stimuli of a nat- 
ural constructive nature were given under 
identical conditions. 








1. The steadiness of the subjects was 
affected by each of the five stimulus situa- 
tions. 

2. The emotional status of men and wo- 
men is affected to different degrees by the 
same stimuli. 

3. Exercise in basketball affects the 
emotional status of men by decreasing their 
steadiness, whether the period of exercise 
is of long, medium, or short duration. 

4, There is a wide difference in the 
normal steadiness of individuals. 

5. Steadiness is decreased less by a 
long period of strenuous exercise in rela- 
tion to the normal steadiness of the in- 
dividual than by a medium period of exer- 
cise. 

6. A short period of exercise in the 
water decreases the steadiness of women. 

7. A longer period of exercise in the 


| water produces little additional effect on 





the emotional status of women, but a long 
period of strenuous exercise in the water 
increases the steadiness of women to a de- 
gree closely approaching their normal steadt- 
ness, 

8. The steadiness records, treated as a 
whole, possess an exceptionally high degree 
of reliability. 

9. Both praise and reproof produce 
statistically reliable differences in the 
emotional status of men learners. 

10. Steadiness is decreased less in men 
by reproof than by praise. 
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ll. Praise affects the emotional status 
of men by reducing steadiness to a greater 
Jegree than does either a long or short pe- 
riod of exercise. 

12. Reproof affects the emotional sta- 
tus of men by producing a higher degree of 
steadiness than that evidenced for either 
the medium or long exercise period. 

13. Women are steadier than men under 
normal conditions. 

14, Women are steadier than men under 
all stimlus situations used in this experi- 
ment. 

15. Praise acts on the emotional status 
of women to increase their steadiness, while 
for men it decreases their steadiness. 

16. Reproof increases the steadiness of 
men and decreases the steadiness of women. 

17. An exercise period of medium length 
in basketball reduces the steadiness of men 
while a similar period of exercise in the 
water has little effect on the steadiness of 
women. 

18. A period of long strenuous exer- 
cise in basketball decreases the steadiness 
of men while a similar period of exercise 
in the water tends to increase the steadi- 
ness of women, 

19. Praise affects the emotional status 
of women by increasing their steadiness. 

20. Reproof decreases the steadiness of 
women. 

21. A period of short and medium length 
exercise in the water decreases steadiness 
of women, but a longer, more strenuous peri- 
od tends to increase steadiness. 
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MEASURING ATTITUDE TOWARD THE HIGH SCHOOL * 
by 
H. H. Remmers, 
G. C. Brandenburg, 
and 
F. H. Gillespie 
Purdue University 


INTRODUCTION 


The proposition that desired education- 
al outcomes must somehow be measured rather 
than assumed no longer needs defense among 
competent educators. The fallacy inherent 
in the converse proposition has been too 
abundantly demonstrated in the experimental 
literature to require any extended argument 
here. Whether the scientific method cuided 
by an adequate social-educational philosophy 
be a sufficiently trustworthy tool, is still 
a matter of some debate among sincere and 
thoughtful educators, but any alternative 
is, we believe, in the nature of a retro- 
gression and a council of despair, 

Among the significant outcomes of edu- 
cational procedures pupil attitudes have re- 
ceived a great deal of attention by way of 
discussion in the educational literature, 
but attempts at measurement have been no- 
tably scanty. An enormous body of litera- 
ture has accumulated within the past two 
decades or so around the problem of measur- 
ing achievement in the traditional subject- 
matter of the curriculum, so that in this 
sector of educational Wissenschaft most rap- 
id and gratifying progress has been made, 
In the case of presumably less "tangible" 
outcomes or objectives such as attitudes, 
however, the situation, while distinctly 
promising, is far less satisfactory. 

The reasons for this state of affairs 
are somewhat complicated and obscure. There 
are, or at least have been, the sheer tech- 
nical difficulties of the theoretical de- 
velopment, definition, and practical con- 





struction of adequate measuring instruments 
for phenomena as complex as those subsumed 
under the term "attitudes." The rapid de- 
velopment of statistical tools and especial- 
ly the more recent work in psychology relat- 
ed to this problem have made possible rapid 
advances. Besides these technical difficul- 
ties, however, the problem is beset by the 
inertia, not to say recalcitrance, which ex- 
ists with reference to the invasion of what 
in psychology have been called the emotions, 
The measurement of affectivity is made dif- 
ficult by the existence of a variety of idols 
of the den and the market place, to use Ba- 
con's picturesque but highly descriptive 
terminology. "Vested interests"--sentimen- 
tal, social, economic, traditional--in the 
form of folkways have stood in the way of 
such measurement, and will likely continue 
to act as a brake upon progress in this di- 
rection, particularly with reference to ed- 
ucational, social, and economic changes which 
may be indicated as the result of such meas- 
urement, 

But even within a very narrowly defined 
status quo representing the resultant of 
present educational and social lines of 
force, so to say, there is still possible a 
considerable measure of science applied to 
the measurement of affectivity. That is, 
there are attitudes presumably having their 
origin in the social-educational continuum, 
upon which all individuals concerned would 
agree as being desirable of measurement. The 
attitude of the individual student toward 
the high school as such we conceive to be 
such an attitude. Inthe nature of the 





1. Division of labor in this study is as follows. The problem was suggested by Remmers who ie also responsible for the 
writing of the present paper. Gillespie under the joint direction of Brandenburg and Remmers constructed the atti- 


tude scale described in this paper as a master's thesis. 
data supplied by Gillespie and Delia L. Silance. 
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case it will be safest to begin a program of 
measurement which shall include attitude as 
a significant variable with one free fron 
possibly unpleasant repercussions. The scale 
to be described below is the first of a se- 
ries that is planned for experimental use, 
particularly in educational situations, 

The attitude scale is based upon the 
psycho=-physical principle that equally often 
noticed differences are equal, a principle 
very fruitfully applied to the scaling and 
measuring of attitudes by L. L. Thurstone 
and his students. In the references given 
at the end of this paper the interested read 
er who is not familiar with the details of 
the underlying theory and procedures will 
find adequate discussion of these. Such de- 
tails are omitted from this paper in the in- 


terests of space conservation. Suffice it 
to say that affectively toned debatable 
statements which an individual endorses are 


assumed to constitute a valid index of the 
individual's attitude toward the object de- 
fined by the attitude scale, in the present 
case attitude toward the high school. Fol- 
lowing are the two forms of the scale as 
they were used in investigations reserved 
for later publication elsewhere. 

The scale values (medians) and Q values 
(interquartile range) for the statements 
which were finally included in the scale on 
the basis of experimental evaluation with 
100 subjects are given in Table I. These 100 
subjects sorted 110 affected statements. 
From these, 50 were retained for two forms 
of the scale, on the basis of two criteria: 
(1) scale values covering approximately the 
entire scale (eleven points) and (2) small 
Q values, showing absence of ambiguity. 

The reliability of the scale was deter- 
mined from two independent samplings in two 
different high schools, One sampling of 179 
pupils, approximately two thirds of the pu- 
pil population in a small town high school, 
yielded a reliability coefficient for one 
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| form of .753 + .02. Another sampling of 300 
| from a total high school population of ap- 
proximately 1,250 in a city of about 30,000 
inhabitants cave a reliability of .727 + £2. 
These reliability values become .859 + .02 
and .841 + .02 respectively when stepped up 
by the Spearman-Brown formula for the rella- 
bility of the sum of the scores on both 
forms of the scale. This value compares fa- 
vorably with the reliability of most stand- 
ardized educational measuring instruments.? 

The scale is scored by averaging for 
each subject the scale values of all items 
which he had placed a check mark, a proce- 
dure quickly accomplished by the aid of an 
adding machine and a stencil on which appear 
the scale values corresponding to each of 
the items in the scale, 





TABLE I 
SCALE VALUES (MEDIAN) AND VARIABILITY VALUES 
(INTERQUARTILE RANGE) DETERMINED FOR TWO 
FORMS OF THE SCALE FOR MEASURING ATTITUDE 
TOWARD THE HIGH SCHOOL 








Statement | Scale Value | Interquartile Range Value 
Number Form A | FormB Form A ] Form B 
1 | 71 | 06] 22 Tle 
2 8.8 6.4 1.6 1.2 
3 | 3.2 7.2 2.5 2.4 
4 3.5 | 81 2.4 | 1.5 
5 9.0 7.3 1.8 1.8 
6 2.4 4.5 1.7 2.5 
7 2.2 | 2.2 2.2 2.4 
8 5.3 | 6.1 1.6 1.6 
9 | 6.8 | 9.2 1.7 1.8 
10 | 7.2 | 5.9 1.2 Ask 
ll | 9.4 | 44 2.2 2.2 
12 7.2 | 8.8 2.8 1.7 
| 13 | 6.0 | 8.9 0.9 2.2 
14 | 4.5 | 3.6 2.9 2.2 
15 2.8 | 2.7 2.5 1.9 
16 1.4 | 2.9) 2.4 2.0 
17 | 0.6 | 5.8 1.8 1.9 
18 | A | 86 1.8 0.4 
19 4.4 | 1.3 1.9 2.6 
20 | 10.8 | 3.8 1.8 2.1 
21 5.7 2.0 1.9 1.9 
| 22 3.5 |10.1 1.9 2.0 
23 1.5 1.2 2.0 2.2 
| Ba 9.8 | 9.7} 2.0 1.6 
L 25 9.5 9.4 2.0 fe 1.3 














1. Gillespie's thesis gives the details of the experimental procedure in scale construction and conteins the scale and — 


Q values for all the original 110 statements. 


The thesis is on file in the Purdue University Library. 


2. In another place is discussed the difference in reliability coefficients to be expected from two different methods of 
determining reliability—the "split-test" and "equivelent forms." See H. H. Remmers, "A Possible Experimental Error 


in Determining the Overlap of Two Correlated Variable," to appear in an early number of the Journal 
chology. The theoretical argument made in this article has since been verified experimentally, and 
shorty. The reliabilities reported for many analogous educational messuring instruments 
split-test procedure and are without doubt in general spuriously high. 
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HIGH SCHOOL ATTITUDE SCALE 
Form A 


Please fill in the blanks below. (You may leave the space for your name blank if you wish.) 


> WN PY 


wn 


6. What life work do you expect to enter? 
7. Have you ever failed in a school subject in the grades? 


In high school ?.... sestectereccecessseeeeeeeeee AE SO, in What subjects? 


Name .. 
Age when school started this year 

Grade in school...... 

Boy or girl................ nianmnnsaabens Lcataindnienoation 


Occupation of parent or guardian.. sesatereaialencyaacas ; 


Below is a list of twenty-five statements about school. Place a check mark before each statement 
with which you agree, and leave unmarked those with which you disagree. This test will in no way 
affect your standing in school. 


1. 
_ & 
3. 


ur 


I would rather go to school than work. 

I like to do school work. 

My classes are very uninteresting. 

It costs too much to go to school. 

It helps one to get a job if he has a high school diploma. 

I can learn more working on a job than in high school. 

A high school graduate is often worse off morally than he was before going to high school. 
High school is all right, but I don’t like it. 

High school has its drawbacks, but I like to go. 

I would rather go to high school than to stay at home. 

A high school education will help one to be a good citizen. 

I don’t like to associate with people who haven't a high school education. 
I don’t care about high school, but I think one ought to. 

High school may be all right, but I don’t think it does any good, 

Most people would be just as well off not to go to high school. 

High schools are a waste of the taxpayers’ money. 


I think every high school in the country ought to be closed and the teachers put to work 
at something useful. 


High school teachers are parasites on the community. 

If one’s parents need him at home he ought not to go to high school. 

A high school education is worth a million dollars to any young person. 

Sometimes I like high school and sometimes I don’t. 

There are too many rules and regulations in schools. 

In high school pupils learn to disrespect everything of high idealistic character. 

The high schools do more to develop high standards of conduct than any other institution. 
High school develops self reliance. 


Place a check (\/) below along the line to indicate your attitude towards school. A is extremely 
favorable and K is extremely against. F is neutral and the other letters indicate graded steps between, 





ABCDEFGHIJK 
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HIGH SCHOOL ATTITUDE SCALE 
Form B 


Please fill in the blanks below. (You may leave the space for your name blank if you wish.) 


Age when school started this year.... 

Grade in school 

Boy or girl 

Occupation of parent or guardian... 

What life work do you expect to enter ?... 

Have you ever failed in a school subject in the grades? 


a I Bcctcaipeicdieepeeencctcetitsehcacandansiciin ... If so, in what wilde? 


Below is a list of twenty-five statements about school. Place a check mark before each statement 


with which you agree, and leave unmarked those with which you disagree. This test will in no way 
affect your standing in school. 


School is like a prison. 

I have a lot of fun in school. 

School is all right. 

My teachers always treat me fairly. 

I like to go to school to be with other people. 

Many of our great men have no high school education. 

I hate most school work. 

Some things about high school are all right. 

High school training develops personality. 

High school is a good thing for some people and a bad thing for others 
I would just as soon stay at home as go to school. 

All the better class of people have high school educations. 

The high schools lift the plane of sportsmanship in a community. 

Too much money is being spent on the high schools for the benefit received. 
The high school teaches mostly old useless information. 

They won't teach things one really wants to know in high school. 

If one has plenty of money it may be all right to go to high school. 

I haven’t any definite like or dislike for high school. 

Any old fogy knows more than a high school graduate. 

The kindest and best people I know don’t have high school educations. 
High school cramps and dwarfs one’s personality. 

America could not stand as a nation if it were not for our high schools. 
Our high schools teach immorality and indecency. 

High school training develops high ideals in the pupils. 

High school develops loyalty. 


Place a check (\/) below along the line to indicate your attitude towards school. A is extremely 
favorable and K is extremely against. F is neutral and the other letters indicate graded steps between. 
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THE MEASUREMENT OF ATTITUDES TOWARD CURRENT 





POLITICAL AND ECONOMIC PROBLEMS AMONG 
JUNIOR AND SENIOR HIGH SCHOOL PUPILS 
by 
Milo L. Whittaker+* 


PROBLEM FOR INVESTIGATION 


The purpose of this study is to discow 
er what attitudes school children possess in 
regard to current political and economic 
problems; to determine the effect upon atti- 
tude of different environmental factors such 
as arural, town, or city background; to 
discover the relation of attitude to socio- 
economic status and to general intelligence; 
to investigate the relation of attitude to 
school grade; and finally to suggest what 
consequences for civic training might be de- 
duced from discoveries resulting from this 
study. 

Although there is some disagreement as 
to the meaning of the term "attitude", in 
this study it will be used to denote "the 
sum-total of man's inclinations and feel- 
ings, prejudice or bias, preconceived no- 
tions, ideas, fears, threats, and convic- 
tions about any specific topic" (14, p. 6-7). 

Students of social psychology are not 
in complete agreement as to the practicabil- 
ity of measuring attitude. Some take the 
position that verbalized responses do not in 
any real sense reveal attitude. The conten- 
tion of Bain (2) is that people tend to re- 
spond in conventional ways even though they 
do not have to sign their names, that most 
thinking is rationalization or stereotyped 
reaction, that questions are interpreted 
differently, and that words often refute ac- 
tion and vice versa. The fact that a high 
correlation frequently exists between suc- 
cessive answers to the same question does 
not establish the reliability of the method. 
On the other hand, Katz and Allport (5, p. 
vii-xi) believe that if the kind of informa- 
tion sought is such that a frank answer will 
not embarrass the subject, if the questions 
are so worded that the meaning is clear, and 


Northern Illinois State Teachers College 


if the results are interpreted with the full 
understanding that responses to certain types 
of questions may lack candor, the attitude 
questionnaire may be considered a legitimate 
device in research, 

Perhaps the clearest statement as to 
the possibility of measuring attitude has 
been presented by Thurstone and Chave (14). 
These authors recognize at the outset the 
| complexity of an attitude and the fact that 
it cannot be completely described by any sin- 
gle numerical index. This may be said, how- 
ever, of any measuring device, whether for 
measuring a table or the temperature of a 
room. "But this discrepancy between the in- 
dex and 'truth' is universal" (14, p. 8). 

Although a subject may be hiding the 
truth and practicing deliberate deceptions, 
it is nevertheless desirable to know what 
| people claim to believe about certain ques- 
tions. It is pointed out by both Katz and 
Allport and by Thurstone and Chave that we 
should not assume that an overt act is con- 
clusive proof that we have discovered a per- 
son's true attitude. A moment's considera- 
tion should convince us that people often deceive 
by action as well as by words. The overt 
'act of the politician in the presence of his 
constituency, the sanctimonious religionist 
| praying before the altar, or the salesman in 
| the presence of a prospective customer are 

probably no more accurate revelations of at- 
| titude than responses to a questionnaire. 





| 
| 
| 
| 
| 
| 
| 
| 





| 
| 


"All that we can do with an attitude 
scale is to measure the attitude actually 
expressed with the full realization that 
the subject may be consciously hiding his 
true attitude or that the social pressure 
of the situation has made him really be- 
lieve what he expresses. This is a mat- 
ter for interpretation. It is probably 
worth while to measure an attitude ex- 
pressed by opinions. It is another 








1. This study was directed at the Iowa Child Welfare Research Station by Dr. George D. Stoddard. 
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problem to interpret in each case the ex- 
tent to which the subjects have expressed 
what they really believe. All that we can 
do is to minimize as far as possible the 
conditions that prevent our subjects from 
telling the truth, or else to adjust our 
interpretations accordingly" (14, p. 10). 


In brief, the possibility of measuring 
attitude depends upon the ability of the 
investigator to present a situation which 
will in no sense embarrass the subject, to 
place before him questions about which he 
may be expected to have an opinion and 
which are stated in such language as may be 
clearly understood. 

Investigators should not become unduly 
disturbed by the discovery that overt behav- 
ior does not always correspond to the re-=- 
sults of attitude tests. Allport's (1) study 
of 375 Dartmouth students shows that about 
one half of the students who were found to 
be political radicals by the test failed to 
vote for the radical presidential candidate, 
Many of us need not go beyond our om expe- 
rience to be able to assert that radical tend- 
encies in politics do not always find ex- 
pression at the polls. The pronounced rad- 
ical whose political ideas have crystalliz- 
ed will vote the radical ticket, but there 
are no doubt many others who for various 
reasons do not express their radicalism at 
the polls. Nevertheless they may be classed 
as possessing radical political attitudes. 
It is a tribute to Allport's test that it 
discovered twice as many radicals as were 
found at the polls. On the other hand, Por- 
ter (10) found that students who had overtly 
identified themselves with pacifist activi- 
ties scored near the pacifist end of the 
scale, while R.O.T.C. officers scored near 
the opposite end. 

Neumann (7) applied his attitude test 
on international relations to a number of 
army officers, members of the Stock Exchange 
on Wall Street, a number of declared paci- 
fists, and other groups known to be pro- 
nounced in their international attitudes, 
The results were highly consistent. In view 
of the fact that it is a matter of common 
observation that one's agreement or disa- 
greement with opinions of a controversial 
nature affect his behavior, "there seems to 
be no reason why this behavior should sud- 
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denly become non-significant when it is made 
the subject of careful inquiry." (6) 

Most of the studies of attitude toward 
political and economic questions which have 
hitherto been made deal with college stu- 
dents or other adults, Little has been done 
toward investigation of the age when atti- 
tudes toward such problems begin to take 
form. Whether political and economic atti- 
tudes change significantly as children pass 
from grade to grade, and if so, in what di- 
rection, are problems worthy of careful in- 
vestigation. Are these attitudes affected 
to a measurable extent by social backgrounds? 
Is there a "rural mind" and an “urban mind" 
which may be detected by attitude tests? To 
what extent does the literature of the social 
studies to which children are exposed in 
school affect their political and economic 
attitudes? These are the chief problems 
which constitute the present investigation. 


METHOD OF STUDY 


Field to be Investigated 





This investigation was conducted along 
three lines, namely, attitude toward the 
present form of government in the United 
States, attitude toward capitalism as ex- 
pressed in our present economic system, and 
attitude toward the labor group. The study 
is concerned primarily with three distinct 
groups of pupils in grades seven to twelve 
inclusive, pupils living in a city of 20,000 
population, those living in small towns from 
300 to 600 population, and the third group 
consisting of pupils living in the open 
country. 

Care was taken to include in the study 
enly children who had lived at least three 
years in the group in which they were stud- 
ied. As a matter of fact, however, few cas- 
es were included who had not spent all or 
most of their lives in the group in which 
they were classified. A total of 612 pupils 
was studied, with slightly more than 200 in 
each group and at least 25 in each grade. 

One hundred seventy-six pupils in 
grades seven, eight, and nine, who have had 
from four months to three year's study of 
the Rugg social science books, were used in 
a subsidiary survey. This survey was made 
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for the purpose of discovering what differ- 
ence, if any, might be found in attitudes 
of this group as compared with other pupils. 

The tests used in this study were ad- 
ministered during the month of December, 
1932. 
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rect. Each statement is followed by a group 
of four conclusions or opinions introduced 


| by the statement, "The above statement shows." 


The pupil is to mark with a plus sign eachof 


| the four conclysions or opinions which he is 


willing to accept. He is to use a minus 


| sign for each one which he is unwilling to 


Constructing the Test} 





| he is unable to make a decision. 


The first step in the investigation 
consisted of the construction of an atti- 
tude test. The Watson (15) technique of 
measuring fairmindedness, with certain mod- 
ifications, was used. Watson's test is in 
five parts, three of which have been used 
in this study. These three parts are des- 
ignated as Tests A, B, and C, respectively. 

Test A, which is called a generaliza- 
tion test, consists of twenty-four state- | 
ments such as, "Members of labor unions are | 
dangerous persons," "Laws are more favorable | 
to the rich than to the poor," and "Million- | 
aires came by their wealth honestly." Each | 
of the twenty-four statements is preceded by | 
the words All, Most, Many, Few, No. The pu- | 
pil is instructed to underline the word 
which makes the statement one which he is 
willing to accept. Instead of being asked 
to mark the statement either true or false, 
he has the choice of five different respons- | 
es. The purpose of this test is to measure | 
the pupil's tendency to make broad generali- 
zations or overstatements. 

Test B, which is called degree of truth 
test, is a measure of intensity of opinion. 
Here the pupil is confronted by twenty-one 
statements similar to those in Test A, but 
each statement is preceded by the numerals 
1, 2, 3, 4, and 5. The pupil is instructed 
to underline 1 if he feels that the state- 
ment is completely or certainly true; under- 
line 2, if probably true; underline 3, if 
not certain or if it is an open question, If 
probably false, underline 4; and underline 5 
if the statement is believed to be complete- 
ly or certainly false. 

Test C, which is called testing the 
meaning of facts, is designed to reveal the 
tendency of pupils to draw unwarranted con- 
clusions from facts. This test consists of 
a series of eight statements which the pupil 
is told should be accepted as true and cor- 


| 
| 
| 


| must be an acceptance. 


accept, and a zero for each one about which 
An illus- 
trative statement together with the opinions 
or conclusions is presented below: 


| Statement of Fact 





The average net income of all corporations 
in the United States since 1921 was approx- 
imately eight billion dollers. These in- 
comes averaged less than five billion dol- 
lars from 1913 to 1915. 


| The above statement shows: 


1. That the corporations are whole-hearted- 
ly serving the public interests under the 
direction of the best and most intelligent 
citizens. 


2. That 
people. 


the corporations are robbing the 


3. That large corporations force their em- 


ployees to work harder and for less pay 
than in small industries, thus increasing 
their profits. 


4. The economic development of the country 
during the past few years has greatly fa- 
vored production by corporations to the dis- 
advantage of the small producer. 


The technique employed in Test C dif- 
fers materially from that of Watson in that 
his test presents an opportunity for marking 
but one of the several possible reactions to 
each statement of fact, and that marking 
In the present test 
the pupil is given an opportunity to react 
to each conclusion or opinion either by ac- 
cepting, rejecting, or admitting that he can- 
not decide. A further difference lies in 
the fact that in the Watson test there is at 
least one neutral statement under each 
statement of fact. This neutral statement 
is merely a colorless reiteration of the 
statement of fact. Since the pupil is to 
mark but one of the conclusions or opinions 
under each statement of fact, he may easily 





1. See appendix for the directions used in administering the 
not reproduced. 


test. Due to limitations of space the test itself is 
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take refuge behind the colorless statement, 
thereby concealing his attitude toward the 
problem. Instead of measuring the pupil's 
attitude of fairmindedness, it is possible 
that it measures only his ability to evade 
the issue. Test C, constructed by the writ- 
er, avoids this situation by requiring the 
pupil to react to each item. 

It is important to note that none of the 
tests used in this study force the pupil in- 
to a position of expressing an opinion eith- 
er stronger or weaker than is represented by 
his actual convictions, With a five-point 
gradation for his various reactions in Tests 
A and B, and with an opportunity to accept, 
reject, or register inability to decide in 
Test C, he is quite thoroughly safeguarded. 


Selecting and Classifying the Content 
Material of the Tests 





The statements comprising the three 
tests are based on the Rugg (11, 12, 13) so 
cial science textbooks. Because of the un- 
usually thorough research and experimenta- 
tion upon which the Rugg series is based, 
these books seemed to be a very desirable 
source from which to obtain the material for 
the tests. A rather complete list of the 
topics covered by these books was made and 
from these topics the statements comprising 
the preliminary tests were prepared. The 
preliminary tests contained about twice as 
much material as was desired for the final 
tests. The surplus material was eliminated 
in a manner to be explained later, 

In order that the tests should measure 
attitude in the three fields previously 
named, attitude toward the government, to- 
ward the present capitalistic system, and 
toward the labor group, it was necessary to 
have all items in the tests classified under 
these three heads. The items in Test A and 
Test B were classified by fifty-five gcradu- 
ate students in sociology at the State Uni- 
versity of Iowa, while the material in Test 
C was classified by forty-seven college 
teachers. All items were eliminated from 
the test upon which less than 60 per cent of 
the judges agreed as to classification, 
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Evaluating the Test Items 





In order that the results of an atti- 
tude test may be expressed in quantitative 
terms with a degree of accuracy which is at 
all satisfactory, it is necessary to assign 
a numerical value to each possible reaction 
presented in the test. We may say that one 
| object possesses more beauty than another 
| obyect, or that this person is more radical 
| than that person. Such statements are in a 


| crude sense measurements of attitude, but 





the results of such measurements are not to 
be regarded as possessing much practical 
' value. What we desire to know is: How much 
/more beauty does this object possess than 
another? How much more radical is this per- 
| son than that? When these "more" or "less" 
| concepts are thought of as being represent- 
ed on a linear scale, it is possible to as- 
sign numerical values to them. 

The items in Test A and Test B were 
evaluated by a modification of the Thurstone 
| technique (14). Each statement in Test A 
of the preliminary test was mimeographed up- 
on a separate ticket, once using the "all" 
prefix and again using the "no" prefix. 
These tickets were given to groups of stu- 
dents in both graduate and undergraduate 
classes, who by following carefully prepared 
instructions! were to place each statement 
somewhere upon an eleven=-point scale. This 
placement would represent the degree of fa- 
vorableness or unfavorableness toward the 
field represented by the statement. The 
center of the scale, No. 6, indicated neu- 
tral or nearly neutral attitudes; the ex- 
treme right, No. 11, most favorable atti- 
| tudes; and the extreme left, No. 1, most un- 
favorable attitudes. The judges pleced all 
other tickets on the scale in accordance 
with the degree of favorableness or unfa- 
vorableness indicated by the statement. It 
should be noted that the judges were not 
asked to indorse or reject any of the state- 
ments contained on the tickets. Their task 
was simply to evaluate the statements made, 
The tickets were then banded and numbered 
by each judge so that in tabulating them 
each ticket would be given the place as- 
Signed by the judge, 








1. See Appendix. 
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The scale value of each statement was 
4etermined graphically by means of a curve 
of accumulative proportions. 
4icates the direction and degree of attitude 
and the Y-axis represents the accumulative 
proportions. The point at which the curve 
crosses the 50 per cent line indicates the 
scale value of the statement based on the 
median opinion of the judges. The measure 
of ambiguity of the statement is indicated 
by the distance between the two quartile 
points of the curve, i.e., the distance on 
the scale between the scale value at which 
the curve has an ordinate of .25 and its 
scale value at which it has an ordinate of 
.75. This distance is called the Q-value. 

A statement with a large Q-value indi- 
cates that the judges did not agree well up- 
on the scale value of the statement. If the 
Q~value is too large, the statement possess- 
es little value as a measurement of attitude. 
Just how large the Q-value should be before 
a statement should be eliminated from the 
test is a matter of opinion. Thurstone (14) 
kept no statement whose Q-value was greater 
than 3.0. The Q-values in Thurstone's ex- 


The X-axis in- 
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| three middle statements. 
mining their values in the manner described 


69 


ing upon their direction from the neutral 
point. 
After having obtained score values for 


| the two extreme positions in each statement, 


to the 
Instead of deter- 


it was necessary to assign values 


above, a process which would have so over- 
burdened the judges as to make the procedure 


| impractical, the assumption was made that 


the scores of these positions in the test 


|would appear as a simple mathematical dis- 


tribution between the two extreme positions. 


'To illustrate, Statement No. 2 in Test A is 


‘or 0.1. 


periment were kept low, in part at least, by | 


discarding the work of certain judges if 
there was strong indication of carelessness. 
By this means the work of slightly more than 


his tabulated results, 

Although there was considerable evi- 
dence of confusion in the minds of the judg- 
es who evaluated the material comprising the 
tests under consideration, nevertheless, no 
reports were discarded. In consequence of 
this, the Q-values are on the whole larger 
than in Thurstone's experiment. 
Q-value was found to be 3.38, 

For purposes of scoring, the scale val- 
ues referred to above were translated into 
scores by determining the distance in steps 
from the neutral point on the scale, namely 
5.5, to the point represented by the scale 
value, Thus, if a scale value were 8.6 on 
the eleven-point scale, it would indicate a 
score of 8.6 minus 5.5 or a score of 3.1 in 
favorableness toward the field under inves- 
tigation. A scale value of 1.2 would indi- 
cate @ score value of 5.5 minus 1.2 or a 
score of 4,3 in unfavorableness. All scores 
are Stated as either plus or minus, depend- 


'"Criminals in our large cities are shielded 
| by officials." 
/en a score value by the judges of -4.3 while 
| the "No" statement was given a score of 4.5. 
| The distance between these two extreme posi- 


The "All" statement was giv- 


tions on this statement is 8.8. Since there 
are four steps from "All" to "No", the score 


| value to be assigned to "Few" will be dimin- 
| ished by 2.2 points (1.e., 8.8) or 2.3. The 


be 2.3 - 2.2. 
The statement "Most" receives its 
score in the same way. 

Score values for Test B were determined 


value assigned to "Many" wi 


/in the same manner except that the mimeo- 


graphed tickets placed in the hands of the 


| judges contained the direct statements as 
12 per cent of his judges was discarded from | 


given in the test and an equal number of 


| tickets in which the opposite of this state- 


| 


ment was expressed. Thus, the first state- 


| ment in Test B is, "The laws are more fa- 

| vorable to the rich than to the poor,” while 
| the opposite statement was, "It is not true 
| that the laws are more favorable to the rich 
| than to the poor.” 


The average | 
| judges, 


Tests A and B were evaluated by 116 
After an evaluation by fifty-five 
judges had been made, the scale values were 
computed. They were computed again after 


| the number of judges had been increased to 
| 116. 
| the average scale value was changed only 0.1 


This seemed a sufficient number since 


points by the addition of the last sixty-one 
judges. 
The nature of Test C is such that its 


| items could not well be evaluated by the 


Thurstone technique. The score values of 

the various items in this test were assigned 
upon the basis of the judgment of forty-sev- 
en teachers, forty-four of whom were college 
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teachers and three were teachers in high ‘procedure of sorting the tickets by the jude- 
school. The procedure was as follows: One | es wno evaluated the material in the tests 
set of twenty-five copies of Test C was constituted a further validation. The jude- 
marked by the writer as might have been es were instructed to discard all statements 
marked by a pupil except that none was mark- | which in their opinion did not pertain to 
ed with a zero, which indicates inability to | the subject under investigation. The pro- 
make 2 decision. Another set of twenty-five | cedure employed in determining the numerical] 


was marked just the opposite of the first | values of the various test items, which re- 
set. These fifty copies were then sent to sulted in discarding much of the original ma- 
the fifty judges, one copy to each judge, terial, constituted the last step in deter- 
together with a copy of mimeographed in- mining the validity of the test. 


structions for evaluating the test items. The reliability of each test was deter- 
The judges were to determine, first, mined by the self-correlation method. The 
the field to which each item in the test re- | odd-numbered items were correlated with the 
lated, whether Govermment, Capital, or Labor,| even-numbered and the Spearman-Brown formula 
and next, to determine whether the item in- | was applied to determine the reliability. 
dicated an attitude of favorableness or un- | The reliabilities were found to be as fol- 
favorableness toward this field. Complete | lows: Test A, .66 + .04; Test B, .64 + .04; 





returns were received from forty-seven judg- | and Test C, .61 + .04. These reliabilities 
es. All items were eliminated from this restrict the application of the tests to 
test upon which fewer than 60 per cent of group comparisons, 

the judges agreed. Score values were weight- 


ed on the basis of the percentage of agree- PRESENTATION AND DISCUSSION 


ment of the judges as follows: OF MEAN SCORES 





— ————— a 


Per Cent Score The two prime tasks in this section of 
Agreement Value 


} seats the report are to present the results of the 


60--69 | 1.0 application of the tests, which have been de- 
70- 79 | 1.3 scribed under the heading of method, on the 
80- 89 | 1.6 basis of mean scores obtained and to dis- 
90-100 2.0 





mnamianemmmanatiaamas cover what significant relationships, if any, 
exist among the scores of the various groups, 
A study of these mean scores may be 
made in two different ways. The mean plus 
Since there are no standardized tests scores of the various groups may be computed 
in this particular field, the validity of and compared with one another to discover 
the test could not be determined by correlat-| significant trends among those groups which 
ing it with other tests. Its validity rests | show a favorable attitude toward the field 
chiefly upon the source from which the con-| under investigation. Then, the means of the 
tent of the test was taken, namely, the Rugg | minus scores may be treated in the same man- 
social science textbooks. As is generally = ner. This would give at least a partially 
known, the Rugg texts are based upon what is | satisfactory picture of the groups under con- 
probably the most extensive research that sideration. The other alternative is to pre- 
has yet been conducted on what should con- | sent the findings in terms of the means of 
stitute the content of the social studies. net scores, i.e., the algebraic sum of the 
The content of the Rugg texts was ana- | plus and minus scores. Since our chief con- 
lyzed by the writer and a list of more than | cern is with total group attitude, it seemed 
100 problems was made. The present writer | desirable to adopt the second alternative 
discarded problems not germane to the imme- and to deal chiefly with net scores. 


Validity and Reliability 





diate interest in Capital, Labor, or Govern- The reader should recall that the inves- 
ment. The list was further reduced by a | tigation relates to three separate fields to- 
committee of the graduate faculty, under | ward which attitude is being measured, name- 
whose direction the study was begun. The | 1Y> the present form of government, the 
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present economic order, and the labor group. 
A plus score is always interpreted as ex- 
pressing a favorable attitude and a minus 
score an unfavorable attitude toward the 
field under investigation. For convenience 
of expression, the three fields will be re- 
ferred to as Government, Capital, and Labor 
respectively. 

By way of introducing the findings it 
may be said that each of the three groups, 
city, town, and rural, showed a pronounced 
attitude favorable to the Government in Test 
A as well as in Test B. (Owing to the fact 
that so many items in the preliminary mate- 
rial for Test C were eliminated through 
failure of the judges to agree, it was found 
that there was insufficient material in Gov- 
ermment and Labor to constitute a satisfac- 
tory test. Hence only the scores in Capital 
are considered in Test C.) Each group also 
showed a favorable attitude toward Labor in 
both tests. 

In Capital, however, there was a very 
close division of scores, especially in 
Tests A and B. In Test C the mean is in fa- 
vor of Capital but by a relatively small 
margin. Because of this close division, it 
is not surprising that one group may be 
found with a mean score in favor of Capital 
and another with a mean score in the opposite 
direction, but both hovering around the zero 
point. 

The study indicates few significant dif- 
ferences in attitude among the three geo- 
graphical groups, except in Capital. Where 
these differences are large enough to appear 
at all significant, they will be discussed 
in this chapter. The complete data for this 
part of the study may be found in Tables I 
and II, 

The chief fact of importance in connec- 
tion with all tests is the uniformity of 
mean scores from group to group. The rural 
group, however, stands out in certain of the 
comparisons in such a way as to give some 
basis for asserting that, as a group, the 
rural pupils exhibit attitudes which separate 
them from the other groups. The difference 
which this group exhibits is toward a strong- 
er opposition to our economic system. The 
city group stands out somewhat in its sup- 
port of our economic system, being the only 


one to obtain a mean score favorable to Cap- 
ital, 
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Attention should be called to the fact 
that the mean scores in Test C are all fa- 
vorable to Capital. It seems probable that 
this is due to the nature of the test which 
does not allow for the more careful choosing 
of positions on the various statements, and 
also because the method of scoring does not 
permit those finer numerical discriminations 
which are characteristic of Tests A and B. 
The rural group shows a more unfavorable at- 
titude toward Capital in all tests than 
either of the other groups. These differ- 
ences are great enough to be significant. 

The most important statement that may 
be made concerning the town group thus far 
is that it exhibits a tendency to stand mid 
way between the other two groups. This is 
particularly true in attitude toward Capital 
in Tests A and B. 

The position of the three groups in re- 
lation to Capital in the three tests shows a 
very interesting trend. In all tests the 
rural group is shown to possess the most une 
favorable attitude toward Capital while the 
city group is relatively strong in support 
of Capital, especially in Tests A and C. 
These relationships are presented in Figure 
1. It should be emphasized that the signif- 
icance of the relationships existing among 
these various groups which have just been 
discussed does not lie in the relative mag- 
nitude of the scores but in their trend. In 
only two cases were differences found which 
were as much as four times their probable 
error. The only claim made in connection 
with the above presentation is that certain 
relatively small differences show fairly 
definite trends, 

The mean scores on Labor do not show im 
portant trends, In Test A the only differ- 
ence of much importance is that between the 
city and rural groups. Here, the latter 
shows a more favorable attitude toward Labor 
than the former. This, however, is offset 
by the opposite tendency in Test B. Again, 
the city group shows a more favorable atti- 
tude toward Labor in Test B than the entire 
group, and is especially more favorable than 
the town group. This is offset by opposite 
but weaker tendencies in Test A. The most 
that may be said of the attitude of all 
groups toward Labor is that it is favorable, 
Since the differences are of relatively 
small magnitude, none being as much as four 
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MEAN SCORES, STANDARD DEVIATIONS, AND PROBABLE ERRORS OF NET SCORES FOR ALL GROUPS 
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Test B 



































ee 2 aS Test C a 
| Stand- |Probable | ]Stand- | Probable a |Probatle 
Group Mean | ard De-| Error | Mean jard De-| Error Mean (ard oe Error 
| viation|of Mean | viation | of Mean - | viation |of Mean 
Government 

City 5.52 6.76 224 | 4.37 7.82 37 
Town 5.49 7.98 -378 4.72 7.92 -38 
Rural 6.70 8.73 -410 4.18 7.92 37 
Combined 6.02 6.20 .224 | 4.41 7.89 22 

ae a . : Capital = 
City | 0.49 6.92 .328 | -0.50 5.74 227 2.49 7.50 .36 
Town -0.76 5.92 280 | -0.24 5.76 227 2.19 7.56 36 
Rural -1,66 6.45 -310 | -1.05 5.38 225 0.28 7.86 .37 
Combined -0.74 6.56 -179 +#+| -0.70 5.54 15 1.64 7.68 21 

a aa spas iy Labor 
City 4.50 6.30 .298 | 2.53 4.04 .191 
Town 5.16 7.20 .340 | 1.69 4.54 215 
Rural 5.37 6.10 -347 1.59 4.10 .192 
Combined 4.965 6.84 186 | 1.94 4.35 118 
‘ 
TABLE II 


OBTAINED DIFFERENCES IN MEAN SCORES, PROBABLE ERRORS OF DIFFERENCE, AND CHANCES OF TRUE 


DIFFERENCE BEING GREATER THAN 


ZERO BY GEOGRAPHICAL GROUPS 
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Test A Test B | Text C 
| Chances in a |_| Chances in | Chances in 
|Probable | 100 That Mean (|Probable| 100 That Mean | Protable 100 That 
|True Differ-  Differ-| Error (True Differ- | Differ-| Error True Differ- 
of Dif- j|ences Great- ence jof Dif- jences Great-| ence /of Dif- ences Great- 
| ference jer Than Zero | | ference jer Than Zero | ference er Than Zero 
Government 
47 76.0 -07 43 50.0 } 
45 78.5 31 44 68.0 
47 84.0 23 43 63.5 | 
56 50.0 38 53 68.0 
58 91.0 16 052 58.0 
56 92.5 254 053 75.0 
Capital 
~ 3? 98.7 | .20 .80 67.0 85 41 91.5 
235 50.0 46 -30 84.0 .55 41 82.0 
256 95.5 255 02 76.0 1.36 042 98.5 
+44 97.0 226 237 68.0 | 30 -51 66.0 
45 99.9 255 236 84.5 | 2.21 «52 99.9 
42 92.5 81 36 95.5 / 1.91 52 99.0 
=< “Labor mee 
35 81.0 59 =o 8S 9.0 | 
059 63.0 25 025 75.0 | 
40 78.0 250 025 83.0 
245 80.0 84 229 97.7 
246 90.0 94 o27 99.0 | 
.43 61.0 10 _—«w29 51.0 | 
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Figure 1. Trend of mean scores, 
comparing mean scores of entire group 
with each geographical division in 
Capital, Tests A, B, and C. 


— — — — Test B 
Test C 


times its probable error, and no uniform 
trends are shown, little significance can 
be attached to the existing differences. 

Summarizing the differences in attitude 
toward Government, it may be said that the 
chief difference is found in the mean of the 
rural groun in Test A. This group is more 
favorable toward the government than either 
of the others. In Capital, the two impor- 
tant items are the relatively strong tend- 
ency of the city group to support our eco- 
nomic order, and the pronounced drop in the 
curve indicating relatively strong opposi- 
tion to Capital on the part of the rural 
gsroup. The entire group gives relatively 
strong support to Labor, but the scores show 
no trends from group to group. 


| a remarkable rise at the eighth grade. 
| curve is especially high in Test A, rising 
| to a mean of 8.21 which is more than 

that of 


was due 


| the means of another group, 
| discussed later under the title of the 
| group, are presented for comparison. 


| approximately 4,000, 
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| 


There are certain grade differences and 
grade trends in attitude that deserve care- 
ful consideration. The first item to which 
attention should be called is the appearance 
of a regular grade trend in attitude toward 
Government both in Tests A and B. These 
data are to be found in Tables III and IV. 
The relationships are presented graphically 
in Figure 2. It will be noted from a 
sideration of the graph that 


con- 
the curve shows 
The 


twice 
the seventh erade, 

An investization was made to discover 
whether this hizh mean for the eizhth 
to an unusually high mean for 
particular group. Such was not the case, 
In all groups in Test A and in all but the 
city group in Test B, the eizhth grade stands 


rrade 
some 


| high above both the seventh and ninth grades. 


In order that this may be made clear, the 


mean for each of these three grades for Test 


A is presented below: 





Grades 
Group 8 
Teobk 
8.25 
9.60 


City 
Town 
Rural 





| This difference in the eizhth crade mean be- 
| comes more remarkable because of the 
| tively wide sceorraphical areas fromwhich these 
| scores were obtained. 
| groups were taken from points 
| 130 miles apart, while the rural scores were 
| obtained from six different schools scattered 
| over a fifty-mile area. 


rela- 


and town 
approximately 


The city 


As a further check against these scores, 
which will be 
Rugg 
The 
scores of this group come from two widely 
separated towns each with a population of 
The scores for this 
group are: seventh grade, 3.64; eighth 
grade, 6.18; and ninth grade, 0.72. It ap- 
pears, therefore, that this difference in 
the eizhth grade mean cannot be attributed 
to an error in sampling, 
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MEAN SCORES, STANDARD DEVIATIONS, AND PROBABLE ERROR OF MEAN BY GRADES 













































































 _ » (ae ae Test B Test C 
¥ ~ | Stand- | Probable Stand- |Probable Stand- | Probable 
Grade Mean ard De- Error Mean| ard De-| Error Mean ard De- | Error 
viation| of Mean viation|of Mean viation |of Mean 
Government 
. ' r - 
7 4.08 | 9.06 | .55 | 2.95) 6.40 | «51 | 
8 8.21 8.34 | .65 5.46| 8.04 | .55 
9 4.90 | 9.16 | 60 3.47/ 8.13 | .53 
10 6.48 7.32 | .48 4.11; 7.50 | .49 
ll 7.71 | 7.56 | 457 5.76| 6.42 | .48 
12 5.57 | 8.13 | .54 5.39| 7.98 | .54 
— a ss i i 
Capital 
ee ee te 
7 -1.46 6.58 | .40 -1.30| 5.94 36 0.78 | 6.78 242 
8 73 | 6.52 | 45 -0.84) 5.08 35 2.43 | 7.56 52 
3 -1.15 | 6.64 | .44 -1.80| 5.50 36 0.36 | 8.10 54 
10 -1.12 | 6.46 | .42 -0.04) 5.24 34 2.12| 7.50 49 
ll -0.63 | 6.28 | .47 0.13| 5.38 -40 3.35 | 8.46 -63 
12 -2.66 | 6.26 | .42 -0.66| 6.00 | -40 2.12| 8.37 057 
i 
Labor 
7 4.75 7.08 43 1.00; 4.36 -27 
8 4.22 7.74 253 1.95) 4.54 31 
9 3.53 6.45 42 1.99} 4.40 -29 
10 5.46 7.08 46 2.11} 4.10 27 
ll 4.57 8.28 -62 2.37| 4.40 33 
12 5.72 6.45 043 2.19} 4.28 -29 
Other differences worthy of note are in | ment. The difference is quite marked, espe- 


connection with the eleventh grade. This 
grade stands higher in its Government score 
than any other except the eighth. In Test B 
the Government score of this grade stands 
out with a mean somewhat higher than that of 
the eighth grade. The interrelation of dif- 
ferences in Government scores, Test B, shows 
fairly significant differences in most cases, 
with a trend toward higher scores from the 
seventh to twelfth grade, 

The mean scores in Capital show impor- 
tant differences, but fail to indicate the 
definite grade trends that were present in 
the Government scores, In Test A, the 
eighth grade maintained the same high place 
with relation to the other grades, as in 
Government, being the only grade that showed 
an appreciable plus score. The tendency of 
the twelfth grade to deviate from the other 
grades by showing a somewhat weaker support 
of Capital is similar to that found in Govern 





cially in Test A. These relationships are 
presented in graphic form in Figure 3. Test 
C in Capital shows relationships similar to 
those found in Test A. The eighth grade 
stands high but not as high as the eleventh, 
while the downward trend of the curve from 
the eleventh to twelfth grades, which has al- 
ready been noted in the two preceding tests, 
is present here. Wide differences are also 
shown between grades seven and eleven and 
between nine and eleven. 


Significance of Differences in Mean 
Scores 





The discovery and presentation of dif- 
ferences in the mean scores of the various 
groups under consideration is a comparative- 
ly simple matter, but a discussion of their 
significance is a difficult problem. An at- 
tempt is made in the following discussion to 
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TABLI IV 
OBTAINED DIFFERENCES IN MEAN SCORES, PROBABLE ERKOKS OF DIFFERENCE, AND CHANCES 
OF TRUE DIFFERENCE BEING GREATER THAN ZERO BY GRADES 
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L Test A | Test C 
| Chances ia | | Chances in | Chances in 
Probable} 100 That Probable| 100 That Probable 100 That 

| Mean Error True Differ-| Mean Error (True Differ-| Mean Error True Differ- 


ences Great- 
ference | er Than Zero 
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Figure 2. Comparison of grade 
trends of mean scores in Government, 
Tests A and B. 
———-— Test A 
~----- Test B 

point out the significance of the general 
trends of scores and to assign reasonable 
causes for some of the more important dif- 
ferences which appear. No explanation of 
any phenomenon can be complete without an 
adequate knowledge of the important facts 
with which it is associated. The obvious 
difficulty in attempting an explanation of 
the various differences exhibited in these 
tests lies in the fact that there may be im 
portant factors influencing the outcomes 
which the tests do not reveal. For example, 
one does not know to what extent the atti- 
tude of any particular grade has been inflw 
enced by the attitude and opinions of the 
classroom teacher. One does not know to what 
extent factors lying entirely outside the 
classroom may have influenced attitudes. 

Thurstone (14) has shown by recent ex- 
periments that the attitudes of high school 
pupils are measurably affected by moving 
pictures, Again, we cannot be certain of 
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Figure 3. Comparison of grade 

trends of mean scores in Capital, 
Tests A and B. ‘ 
———— = Test A 

—— —-— Test B 
the extent to which children's attitudes are 
colored by the heat of a political campaign 
being waged in the country. These and many 
other factors which cannot be discovered and 
evaluated in a single study may exert influ- 
ences upon children's attitudes toward po- 
litical and economic questions. 

The first item to be discussed is the 
fact that all groups showed a rather pro- 
nounced support of our government. This fact 
when taken by itself should cause no sur- 
prise. It would perhaps cause grave concern, 
even to those who are outspoken critics of 
our government, to find that the majority of 
school children were unwilling to support 
the legally constituted government. The sit- 
uation appears in a somewhat different light 
when attention is directed to the type of 
statements which the majority of these pu- 
pils accepted in registering this support. 
Each statement is supposed to tap a field of 
controversy in which relatively large numbers 
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of people disagree. The significance of the | to the same tune. In other words, the pat- 
sutcome lies in the fact that the mean scores! tern responses to questions of economic im- 


were very high in the light of the foregoing 
facts. It suggests that indoctrination has | 
reached a reletively high point in the 


port have not yet become household words; 
consequently, they are less colored by indoc- 
trinated beliefs. It may not be too fantas- 
teaching of American history and government, | tic even to suggest that for this reason the 
Quite in contrast to the scores in Gov- | responses to the statements in Capital may 
ernment are those recorded in Capital. Here | express a higher degree of thoughtfulness be- 
is found a pronounced tendency toward atti- | cause of greater intellectual freedom. 
tudes which are unfevorable to our economic | The political conservatism of the rural 
system. The question arises at once whether | group has been noted and may deserve a word 
this is not inconsistent with the attitude | of comment. There is a pronounced belief in 
expressed toward the government. It is ob= | some quarters that our farm people are be- 
vious to the well-informed adult that our coming radical. Occurrences of recent date 
economic system is inextricably intertwined | in certain farm areas in the middle west 
with our political system, and that the Whatever 


| give some ground for this belief. 
former depends for its existence upon the | may be claimed as to farmer attitude in gen- 
support of the latter. How then can we ex- 


eral, there is strong reason for believing 


plein the two opposite tendencies on the _ | that the scores resulting from this test re- 
part of these pupils? | flect the political attitude of the area in 
A partially satisfactory answer to which the test was given. The basis for 


this question may be found in the fact that 
the statements in the tests pertaining to 
Government bring into question the integrity 
of "My Country." The religion of national- 
ism has been well taught in our schools. Any 
statement which questions the motives of 
America and those who direct her destiny is 
likely to strike fire in the mind of the 
school child. His history books have been 
well censored. Evidence is not lacking to 
show the solicitude evidenced by various 
patriotic organizations and political groups 
in protecting our schools against historical 
heresy (8, 9). School children have learned 
their lesson well. Are elections in the 
United States won by money? "Certainly not” 
is the orthodox answer of more than 86 per 
cent of the group. Was the territory of the 
United States gained honestly? "Certainly" 
comes the answer of three fourths of the 
groupe 

We do not seem to have been as success- 
ful, however, in teaching the support of our 
economic system. Organized control of eco- 
nomic teaching in the public schools is of 
relatively recent origin and consequently _ tributors have been undergoing a severe 
has not yet reached that high state of or- | strain for the past year or two, 
ganization which is exhibited in the control | It has been noted that a regular grade 
of the teaching of history and government, | trend exists indicating a progressively 


this belief lies in the fact that the area 
in question is in a congressional district 

_which has always elected a Republican con- 
gressman. Even during the Democratic land- 
slide of 1932 this district was safely Re- 
publican. 

The scores in Capital among the geo- 
graphical groups in question have been dis- 
| cussed at length in the preceding pages. It 
| remains only to comment briefly on the fact 
| that the rural group indicates a much strong 
er opposition to our capitalistic system 
than any of the other groups. It is possible 
that this difference, particularly between 
the rural and city groups, may be accounted 
for by the stronger influence of capitalistic 
propaganda in the city than in the country. 
The capitalistic class is chiefly a city 
|class. It would be natural to expect that 
| its influence would be stronger in the city 
| than in the country. Another factor in the 
| particular areas studied is that these farm 
‘ers are situated just on the edge of the 
| Chicago dairy region in which the economic 
| relations between milk producers and dis- 


Neither to the teacher nor to the pupils 
does "My Capitalism" possess the same conno- 
tation as "My Country." They cannot be sung 


stronger support of the government from 
grade to grade. In the light of our previ- 
ous discussion this trend would be expected, 
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The persistent emphasis on nationalism in 
the schools, to which attention has been 
called, together with the usual community 
condemnation of political radicalism seems 
to bear its fruit from grade to grade. The 
peculiar drop in the curve from the eleventh 
to the twelfth grade, as noted in Figure 2, 
may be attributed in part to the fact that 
in this grade a more critical study is made 
of American history and covernment. This 
may be augmented somewhat by that usual ate 
tack of “senioritis" in consequence of which 
this group tends to conden everything. The 
downward trend of the curve in Capital, as 
noted in Figure 3, may be explained upon the 
same basis. It is here that the study of 
economics is introduced, and in one of the 
groups represented in this investigation so- 
ciology also is given. 

The rather disturbing position of the 

eighth grade groups seems to be partly ex- 
plained by the fact that introduction of the 
study of civics occurs at this time in most 
groups studied. While it is impossible to 
present objective evidence that this rather 
spectacular rise in the curve at the eighth 
grade has this as its cause, it may reason- 
ably be in*erred that such is the case, In 
the first place, the textbooks used in 
eighth grade civics were of the orthodox 
type. The elementary civics texts were 
Hughes! Elementary Community Civics, Dunn's 
Community Civics, and Finch's Everyday Civ- 
ics. It may be noted in this connection 
that one of the above texts has been de- 
clared "good" by the Illinois Committee on 
Public Utility Information (4). Further- 
more, it has been the writer's observation 
that very little is done by way of a criti- 
cal study of civics in the eighth grade. 
The influence both of the textbook and the 
teacher would tend to instill a somewhat 
blind faith in our government and its eco- 
nomic organization. 

Another factor which possibly has in- 
fluenced the eighth grade scores is that by 
the time pupils have advanced well into the 
eighth grade the nationalistic tone of his- 
tory texts may begin to make itself felt. 
Fven with a liberal textbook in the hands of 
pupils, there still is present the influence 
of the orthodox classroom teacher which has 
not been measured, 
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| Thus, this first introduction of these 
pupils to the study of government would per. 
haps have a strong tendency to produce a 
“rather pronounced result in the direction in. 
dicated. It cannot be known to what extent 
the results have been influenced by the fact 
that these eighth grade pupils have develop~ 
ed an attitude of responsibility by virtue 
of being the highest class iu the school, 
In all schools studied the plan of eight e1- 
ementary grades and a four-year high school 
course was in operation. It is customary 
for teachers to impress upon eighth grade 
pupils their responsibility for the ideals 
of the school. To what extent this might be 
carried over into political and economic at- 
titudes is mere conjecture, although it is 
perhaps worth mentioning as a possible con- 
tributing factor in the solution of the prob- 
len, 


QUALITATIVE ANALYSIS OF DATA 


Although it is important to know the 
mean scores of the various groups on the 
test as a whole, these scores do not reveal 
the whole truth. Hence, a qualitative an- 
alysis of certain factors is presented here, 


High and Low Scores 





High scores may be made on any of these 
tests only by assuming a uniformly extreme 
position either in a plus or minus direction, 
A few pupils obtained extremely high scores, 
In fact, one pupil received the highest pos- 
sible score to be had in Government on Test 
A which was 26.4, while four received the 
highest possible score on the test in Labor 
which was 24.9, This means that these five 
pupils had no minus scores. Three pupils 
received the highest possible score, 12.2, 
in Labor on Test B. One pupil in the entire 
group reached the highest possible minus 
score, which was -17.1 in Test B on Capital, 

The extremely low scores indicate a4 
somewhat different situation than that rep- 
resented by extremely high scores. A low 
score may be obtained in one of two ways. 
The pupil may mark a number of extreme plus 
statements and then offset his high plus 
score by marking a number of extreme minus 
statements, or he may assume a relatively 
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neutral position on all statements. In eith 
er case his net score, i.e., the algebraic 


sum of his gross scores, will be low. It 
seems improbable, however, that these two 
types of low net scores would indicate the 


same attitude, although the numerical val- 
ues would be the same or nearly so, 

This may be illustrated by two exam 
ples. Pupil 38, an eighth grade rural girl, 
received a net score of -1.1 in one test, 
and pupil 74, a ninth grade rural girl, re- 
ceived a net score of 0.2. These scores 
would seem to indicate that the attitudes 
of these two pupils were very similar. But 
a further investigation reveals that the net 
score of pupil 38 was composed of gross 
scores of 11.8 and -12.9, while the net 
score of pupil 74 was composed of gross 
scores of 4.6 and -4,4, 

This situation suggested a study of 
the composition of a number of low scores to 
ascertain whether they were the result of 
high plus and minus scores balancing each 
other, or whether they were the result of a 
uniformly neutral position throughout the 
test. A low score was arbitrarily consider- 
ed to be any net score not greater than +2.5, 
while a high score was considered to be any 
net score of not less than +10.0. In the 
Capital test the basis of selection was +2.0 
and +7.0, due to the fact that the range of 
score on this test is narrower than in the 
Government or Labor sections. 

All low scores on Government and Labor 
in Test A and on Capital in Test Bwere in- 
vestigated, there being a total of 373 such 
scores. Of this number it was found that 
only 24 scores, or 6.4 per cent of the total, 
were composed of the sum of large scores, 
Thus it seems fairly certain that low scores 
indicate a tendency toward a neutral posi- 
tion maintained rather consistently through- 
out the test, rather than toward high plus 
and minus scores balancing each other. 


Extreme Scores 





Two investigations were made of extreme 
scores, one to determine whether extreme 
scores were characteristic of any particular 
geographical group, and the other to discov- 
er the tendency of the group to assume ex- 
treme positions on the various statements, 
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Test A was used for the former investigations 
and Test B for the latter. A net score of 
+ 20.0 or more was considered as an extreme- 
| ly high score on Government and Labor and 
| + 12 or more on Capital. Eighty-six such 
scores were found in Test A. There was 
tendency for any group to predominate in 
these scores. The distribution was city, 28; 
town, 26; and rural, 32, 

The investigation of extreme scores 
Test B brought to light some interesting 
facts. It will be recalled that this is a 
degree of truth test in which the pupil is 
asked to underline numbers from 1 to 5 in- 
Clusive, indicating the extent to which he 
believes the statement to be true or false. 
| Underlining number 1 means that he believes 
the statement to be certainly or completely 
true, while the underlining of number 5 
means that he believes the statement to be 
certainly or completely false. 

Of the entire twenty-one items in this 
test, 13.9 per cent of the markings of the 
612 pupils were on 1, while 24.4 per cent 
were on 5. These data are found in Table V. 


no 





in 


TABLE V 
STATEMENTS IN TEST B MARKED "CERTAINLY 
TRUE" AND "CERTAINLY FALSE" BY 
GEOGRAPHICAL DIVISIONS 
































State- City Town Rural Totals 
ment |True|False|True|False/True| False! True| False 
1 12| 50/15| 46| 8| 47 | 35 | 143 
2 17 | 74/26 | 53 | 26| 43 | 69| 170 
3 53 | 23 | 44 | 30 | 49| 25 146 | 78 
4 15 | 68/10, 60) 9| 674) 34| 215 
5 48 | 44 | 50 | 57 | 50) 47 | 148 | las 
6 18 58 27! 71 | 30| 58 | 75 | 187 
7 36 | 23 | 24 | 28 | 22| 23 | 62| 74 
8 32 | 103 | 48 | 67 | 47/| 65 | 127 | 235 
9 23) 72/13 | 87 | 16) 8 | SR | 242 
10 39 | 44 37) 35 | 46) 43 (lee | lee 
11 6| 66/12] 41] 11/ 27 | 29] 134 
12 16 | 56/18| 31 / 20| 37 | 54| 124 
13 9| 109 | 5 | 107| 10; 10 | 24| 326 
14 33 | 39) 47| 51 | 38, 49 | 118/ 129 
15 27 75 | 24|) 75 | 27) 88 | 78| 238 
16 26 | 29 | 3 | 36 | 24} 55 | 86| 120 
17 25| 35 | 34| 36/ 33) se | 92/ 105 
18 9| 49/211| 44/ 16; 56 | 36| 152 
19 40 | 37 | 83 | le | 64) 15 | 187| 68 
20 47; 11| 55; 17/ 46; 17 |148/ 45 
21 a2 | 31 | 39) 36 | 38) 30 | 1l9| OF 
Total |573 |1096 | 656 | 1046 | 632 1019 [166¢ | 3161 
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A few examples are given below showing 
statements upon which the marking was very 
pronounced upon one side or the other, and 
some others where the division of opinion 
was fairly equal, 


1. The laws are more favorable to the 
rich than to the poor. (Certainly 
true, 35; certainly false, 143) 


9. The government of the United States 
is controlled by the rich. (Cer- 
tainly true, 52; certainly false, 
242) 


11. Labor unions are usually disloyal 
to the government. (Certainly true, 
29; certainly false, 134) 


15. A poor man has little opportunity 
to succeed in this country. (Cer- 
tainly true, 78; certainly false, 
238) 


20. The Monroe Doctrine is fair to all 
countries. (Certainly true, 148; 
certainly false, 45) 


Statements upon which extreme scores 
were fairly evenly divided were: 


5. Under the present economic arrange- 
ment the rich are getting richer and 
the poor are getting poorer. (Cer- 
tainly true, 148; certainly false, 
148) 


10. Rich people who are accused of crime 
are seldom convicted by the courts. 
(Certainly true, 122; certainly false, 
122) 


14. The Eighteenth Amendment is one of 
the chief causes of crime. (Cer- 
tainly true, 118; certainly false, 
139) 


The rural group accepted fewer statements 
as certainly true than either of the other 
groups, the percentages being city 14.4, 
town 15.3, and rural 11.9. The rural group 
was also slightly more cautious in rejecting 
statements by marking them certainly false, 
although the difference is too small to be 
of much significance. The percentages were 
city 25.2, town 24.5, and rural 23.4, 

An interesting picture is presented by 
separating these extreme scores by school 
grades. If the assumption is made that 
there are relatively few statements in the 
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field of political and economic controversy, 
which may be pronounced as certainly or com 
pletely true or certainly or completely 
false it might be justifiable to entertain 
the hope that as pupils advance from grade 
to grade they would become progressively 
less willing to express such high degrees of 
certainty. Yet the study fails to justify 
this hope, as is shown by the percentage of 
statements distributed by grades: 


Grade Percentage 
7 24.7 
8 24.9 
9 23.2 
10 23.1 
11 26.9 


12 23.8 q 


A somewhat better showing was made in the 
acceptance of statements as certainly true: 


Grade Percentage 
7 17.7 
8 15.0 
9 17.6 

10 12.3 

11 12.1 

12 11.7 


Perhaps this last tabulation deserves 
some comment. An interesting junior-senior 
high school grouping appears, with the senr- 
ior group decidedly lower. The average of 
the percentages for the two groups is 16.7 
and 12.3 respectively (Figure 4). 


Group Attitude Toward Various 
Single Statements in Test A 








The discussion of mean scores indicated 
the attitude of the various groups toward 
the three fields under investigation. How- 
ever, it does not show the attitude of these 
groups toward the separate statements which 
make up the test. 

The following section gives: (1) the re- 
sponse of the entire group toward each 
statement in Test A, (2) the response of 
each geographical group toward certain state- 
ments where significant difference might be 
expected to exist, and (3) the reaction of 
various grades to different statements in 
the test. 
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Figure 4. Percentages o: pu- 


pils marking statements in Test B 
as "Certainly true" and "Certainly 
false." Distribution by grades. 


Certainly false 
————Certainly true 


The following statements definitely ac- 
cepted! were taken from Test A: 


10. Wealthy people are almost exclusive- 
ly engaged in making money. 

17. Laborers should be paid high enough 
wages so that they nay live in com- 
fortable homes. 

21. The territory of the United States 
was gained honestly. 

R22. 


American citizens should be given 


employment in preference to foreign- 
ers. 


Statements taken from Test A which were 
definitely rejected are: 


2. Criminals in our large cities are 


shielded by officials. 


4. Labor unions are disloyal to the 
government. 


| 
' 
| 
| 


18. 


The following are statements in 
to whic 


given: 


i. 


ll. 


1é. 


- The laws are more favorable to 


D 
—~ 


- Members of labor unions are danger- 


ous persons. 


Persons who believe 
are Gishonest. 


in prohibition 


Persons who talk in favor of Russian 
communism are patriotic citizens. 


the 
rich than to the poor. 


Elections in the United States 
won by the candidate having the 
most money. 


are 


Labor agitators are anarchists. 


Wealthy people should be required to 
pay a higher rate of taxes than oth- 
2r people. 


Millionaires came by 
honestly. 


their vealth 


Millionaires have become wealthy by 
intelligence and hard work. 


Laborers should be given shorter 
hours without any reduction in 
pay. 


Test A 


h practically neutral responses were 


Elections are controlled by men of 
wealth. 


Bankers are interested in making 
money regardless of the welfare 
their depositors. 


of 


Prices are high because our indus- 
tries are controlled by corporations. 


Decisions of the Supreme Court are 
partial to those who have wealth. 


Strikes by labor unions ought to be 
prohibited by law. 


Public utilities such as electric 
power plants and telephone systems 
should be owned by the government. 





score in opposition to it. 
74.5 to 84.9 per cent. 


- Acceptence or rejection is based on the sum of the total gross score in favor of the statement plus the total gross 
The percentage of gross scores by which the following statements were accepted was from 
The percentage by which others were rejected was from 61 to 86.8 per cent. 


In the list in 


which the response was practically neutral the percentage was approximately 50 per cent for and 50 per cent against. 
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19. Elections are carried on honestly. 


20. Good citizens are in favor of labor 
unions. 


Complete data for these three types of state- 
ments are given in Table VI. 

Some interesting differences were no- 
ticed in the response of the different ge- 
ographical groups to certain statements. 
Where a difference in attitude might be ex- 
pected, the scores of each group were segre- 
gated and compared with one another. These 
differences may best be expressed in graph- 
ic form. Figure 5 shows the distribution of 
the responses to the statement, "Persons 
who talk in favor of Russian communism are 
patriotic citizens." This statement was re- 
jected by all groups, but it is significant 
to note the increasing condemation of the 
statement as the curve proceeds from the city 
to the country group. 

Figure 6 indicates the distribution of 
the response to the statement, "Good citi- 
zens are in favor of labor unions." The 
curve shows a progressive difference again, 
but in the opposite direction. The majority 
opinion is favorable to the statement in the 
city, but unfavorable in both town and coun- 
try. Mean net scores on each statement by 
grades for Test A are presented in Table VII. 
Figure 7 presents the attitude toward the 
statement, "Laborers should be given shorter 
hours without any reduction in pay." 


Trend of Attitude Shown by Grade 
Distribution 








Some interesting facts may be brought 
to light by a study of the grade distribution 
of the attitude toward the various state- 
ments in Test A. When a separation of the 
entire group is made into junior and senior 
high school divisions, there is evidence 
that the senior high school group tends more 
toward a position of conservatism toward 
Government than the junior high school group. 
In all but two statements in Test A relating 
to Government, the senior high school group 
shows higher mean scores than the junior 
high school. Although the latter group shows 


a favorable attitude toward the government in 
all but one of the statements, its mean score 
These data are found in Table VIII. 


is lower, 


AND AGAINST EACH STATEMENT 
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TABLE VI 
TOTAL SCORE OF ENTIRE GROUP IN TEST A, 
SHOWING PER CENT OF SCORES IN FAVOR 
























































Total Score Per Cent 
State- [ 1, fn 
ment Favor Against Favor Against 
1 474.8 554.4 46.2 53.8 
2 296.4 1114.4 21.7 78.3 
3 211.6 1110.9 16.0 84.0 
4 181.3 753.0 19.4 80.6 
5 194.6 1151.0 14.5 85.5 
6 552.7 653.3 45.9 54.1 
7 536.3 418.9 56.1 42.9 
8 307.5 1166.9 20.9 79.1 
9 427.1 411.9 50.9 49.1 
10 616.0 180.3 717.3 22.7 
11 637.3 588.5 51.9 48.1 
12 306.6 1184.4 20.6 79.4 
13 159.4 1054.7 13.2 86.8 
14 354.2 740.4 32.4 67.6 
15 251.1 529.8 32.2 67.8 
16 382.0 623.7 38.0 62.9 
17 1591.0 281.9 84.9 15.1 
18 261.9 947.4 21.7 78.3 
19 940.2 578.3 61.9 38.1 
20 623.5 608.9 50.5 49.5 
21 1032.9 552.2 74.5 25.5 
22 941.2 230.1 80.3 19.7 
23 173.6 851.9 17.0 83.0 
24 571.9 923.7 39.0 61.0 
TABLE VII 
MEAN NET SCORES ON EACH STATEMENT IN TEST A BY 
GRADES 
State Grades 
ment 7 8 9 10 11 12 
1 |-0.235| 0.086; 0.142; 0.036 | 0.466/ 0.0 
2 1.094; 1.799} 0.979] 1.041 1.817} 1.282 
3 | 1.064| 1.542) 1.262) 1.365; 1.844] 1.500 
4 | 1.240) 1.085/| 0.925] 0.681 | 0.638) 0.933 
5 1.200} 1.592! 1.763) 1.574; 2.385; 1.222 
6 |-0.267| 0.329/ 0.139/ 0.075 | 0.469; 0.302 
7 |-0.803 |-0.033 |-0.235 |-0.051 | -0.204/-0.211 
8 | 1.101/ 1.239; 1.572] 1.257 1.646; 1.605 
9 |-0.575| 0.130|-0.260 |-0.120 |} 0.437)] 0.231 
10 |-0.881 |-0.536 |-0.591 |-0.671 | -0.817 |-0.760 
11 | 0.072/-0.160! 0.016 |-0.099 | -0.179)| 0.0 
12 1.226} 1.949) 1.133| 1.522; 1.595| 1.283 
13 | 1.409] 1.980! 1.733| 1.050; 1.672) 1.485 
14 | 0.841] 1.277) 0.533] 0.843; 0.114) 0.354 
15 | 0.756/ 0.927| 0.210/ 0.281; 0.462) 0.069 
16 | 0.461) 0.685) 0.195; 0.320; 0.350/ 0.548 
17 1.794) 1.663) 2.260] 2.413; 2.913) 1.420 
18 |-0.913)|-1.044|-1.405 |-1.206 | -1.228/-1.938 
19 | 0.601; 1.107) 0.685| 0.706; 0.144) 0.465 
20 | 0.341) 0.135/-0.026 |-0.080 | -0.239/-0.934 
21 0.790) 0.815) 0.636} 1.505| 1.750} 1.362 
22 | 0.870! 0.360; 1.152) 1.462; 1.591) 1.617 
23 |-1.165/|-0.907/|-1.289|-1.036 | -1.077|-1.143 
24 |-0.786/-0.975| -0.963/-0.223/| -0.137|-0.158 
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TABLE VIII Although the differences expressed in 
TREND OF ATTITUDE SHOWN BY these graphical representations and those 
JUNIOR-SENIOR AIGH SCHOOL ‘immediately following are not large, in most 
seeinteemenes cases their significance may be emphasized 
——_—~ — 'by drawing out the scale upon which the 
‘ Junior Senior scores are based. Figure 8 illustrates this 
nee | eye ghey |point. The scores are taken from Figure 16, 
—— | It may be seen from the above representation 
CEES | 0.010 | 0-142 that although the differences in mean scores 
Government 1.261 1.343 
Labor 1.209 1.547 |of these groups are small, yet the senior 
| Labor 1.090 0.757 high school group is distributed along the 
Capital 1.500 1.668 /scalie between "Most" and "All", while the ju- 


Capital 0.046 0.266 ‘nior high school group lies between "Many" 
| Capital -0.388 -0.151 ‘and "Most." 
| Capital 1.296 1.456 — 
|Government | ~0.262 0.160 
| Capital -0.678 ~0.742 Attitude in Relation to Content Material 
| Labor -0.015 -0.08% Contained in Social Science Textbooks 
sovermnent 1.412 1.459 
ao po pe It is interesting to know something of 
io | «So -% 
15 | Capital | 0.620 0.258 _the extent to which attitude may be influenc- 
16 | Capital 0.445 0.409 ed by the type of textbook material used by 
| | 
17 | Labor 1.906 2.196 pupils. It seemed worth while to make a 
18 | Capital -1.111 -1.150 
19 Government | 0.834 0.316 
20 | Labor 0.160 -0.085 
21 | Government 0.748 1.521 
22 | Labor 0.815 | 1.550 
23 | Capital -1.128 -1.085 


24 | Labor | -0.900 | -0.178 
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As has already been pointed out, all 
mean scores in Capital hovered around zero, 

In this comparison the senior high school 
group was somewhat less favorable to Capital | 
than the junior high school, the means being | 
-0.84 and -0.59 respectively. 

The mean of the senior high school group | 
shows a decidedly more favorable attitude 
toward Labor. The means were 7.99 for the 
senior high school and 4.25 for the junior 
high school. 

Grade differences in the several sep- | 
arate statements were not found to be par- | 
ticularly significant and were somewhat ir- 
reguler. Some of these more significant 
differences are shown graphically in Figures 
9 to 18 inclusive, These graphs show the 
typical trend from the junior to senior high 
school group. In some cases, notably Fig- | 
ures 10 to 15 inclusive, a regular grade Figure 5. Trend of mean scores 
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trend is shom. These graphs also indicate in city, town, and rural, on State- 


. ment No. 8, Test A "Persons who talk 
ae NESEY OC US Grew Se cepeemte inte in favor of Russian Communism are pe- 


7-8-9 and 10-11-12 grade divisions. This is triotic citizens." A plus score tends 
particularly noticeable in Figures 9, 12, to reject the statement. 
16, and 18, 
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Figure 6. ‘rena Ol mean scores 
in city, town, and rural on Statement 
No. 20, Test A, "Good citizens are in 
favor of labor unions." A plus score 
tends to accept and a minus score to 
reject the statement. 

~-Test A 


brief investigation into this field, 

A comparison was made of attitude scores 
between groups comprising the present study 
and groups that had used the Rugg social sci- 
ence textbooks, These texts have been used 
as the basis for the comparison on the as- 
sumption that their content, as well as the 
method of presentation employed in the texts, 
is distinctly liberal as compared with most 
other texts in the same field. 

The history texts used in the various 
schools of the regular groups are Tryon and 
Lingley, The American People and Nation; Hal- 
leck, The Story of Our Country; and Leonard 
and Jacobs, The Nation's History. In civics, 
the texts used were Hughes, Elementary Com 
munity Civics; Broome and Adams, Conduct 
Through Citizenship; and Finch, Everyday Civ- 
ics. 

Tests A and B were administered in the 
seventh and eighth grades in a town of 4000 
population and in the ninth grade of another 
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Figure 7. Trend of mean scores 
in city, town, and rural on Statement 
No. 24, Test A, "Laborers should be 
given shorter hours without any reduc- 
tion in pay." A minus score tends to 
reject the statement. 
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Figure 8. Enlargement of scale 
showing the distribution of mean 
scores on the scale by grades. Mean 
scores taken from Statement 21, Test 
A, "The territory of the United States 
was gained honestly." 
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Figure 9. Trend of mean scores by 
grades on Statement No. 4, Test A, "La- 
bor Unions are disloyal to the govern- 
ment." A plus score tends to reject the 
statement. 

Test A 
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Figure 11. Trend of mean scores by 
by grades on Statement No. 6, Test A, 
"Bankers are interested in making money 
regardless of the welfare of their de- 
positors." A plus score tends to reject 
and a minus score to accept the state- 
ment. 

Test A 














Figure 10. Trend of mean scores by 
grades on Statement No. 4, Test A, "Mem- 
bers of labor unions sare dangerous per- 
sons." A plus score tends to reject the 
statement. 
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Figure 12. Trend of mean scores by 
grades on Statement No. @, Test A, "Per- 
sons who talk in favor of Russian Commu- 
nism are patriotic citizens." A plus 
score tends to reject the statement. 
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Figure 145. Trend of mean scores by 
grades on Statement No. 2, Test A, "De- 
cisions of the Supreme Court are partial 
to those who have wealth." A plus score 
tends to reject and a minus score to ac- 
cept the statement. 
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Figure 14. 


Trend of mean scores by 


grades on Statement No. 15, Test A, 
"Wealthy people should be required to pay 


a higher rate of 


A plus score tends to reject the 


ment. 


taxes than other people." 
state- 
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Figure 15. 





Trend of mean scores by 


grades on Statement No. 20, Test A, "Good 


citizens are in favor of lebor 


unions." 


A plus score tends to accept and a minus 
score to reject the statement. 






























































Figure 16. 


Trend of mean scores by 


grades on the statement, "Territory com- 
prising the United States was gained hon- 
estly." A plus score tends to accept the 


statement. 
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Figure 17. Trend of mean scores by 
grades on the statement, "Americans should 
be given employment in preference to for- 
eigners." A plus score tends to accept 
the statement. 

-—-—-—-Test A. 
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Figure 18. Trend of mean scores by 
grades on the statement, "Laborers should 
be given shorter hours without any reduc- 
tion in pay." A minus score tends to re- 
ject the statement. 
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| town of approximately the same size. There 

| were 176 pupils in this group. The mean 

| scores obtained from these pupils were then 
compared with those of the same grades in 
the regular group of 612 students. These 

| comparisons revealed some rather significant 
differences in attitude. The data are preé- 
sented in Tables IX and X. 


TABLE IX 
ATTITUDES OF RUGG AND NON-RUGG GROUPS 





Stand- | Probable 
ard De- | Error of 
viation | Mean 


—_ 4. — —_ 


Mean 


— ee 


| —_ 


Test A 


Government 


Rugg 
Non-Rugg 
Capital 
Rugg -1.24 
Non-Rugg -0.70 
Labor 
Rugg 6.93 
Non-Rugg 4.52 


Test B 
Government 
Rugg 1.26 
Non-Rugg 3.87 
Capital 
Rugg -0.83 27 
Non-Rugg -0.78 21 


Labor 
Rugg 3.75 19 
Non-Rugg 4.28 16 





it will be noted that the Rugg group is dis- 
tinctly less willing to support our present 
order of government than the non-Rugg group. 
The difference in Labor scores is also quite 
pronounced. In attitude toward Capital, how- 
ever, the two groups appear quite similar. 

| The relationship between the two groups is 
presented in Firures 1° and 20, 

The number of cases is large enough to 
indicate a significant difference. No fac- 
tors other than textbook usage were revealed 
which might account for these differences, 
Certainly, neither of the communities from 
which the Rugg scores were obtained could in 
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Figure 19. Comparison of mean 
scores of Rugg and non-Rugg groups, by 


grades, in Government, Test A 
— Rugg 
- Non-Rug 


TABLE X 
COMPARISON OF RUGG WITH NON-RUGG 
DIFFERENCES, PROBABLE ERRORS OF 
AND CHANCES IN 100 THAT TRUE 
ENCES ARE GREATER THAN 


g 


GROUPS SHOWING 
DIFFERENCES, 
DIFFER- 

ZERO 








Chances in 


Probable 100 That 
Error True Differ- 
Group biffer- of Dif- | ences Great- 
_ ee | ference | er Than Zero 
eet Test A 
Government | 1.13 | 4.52 | — 
Capital 254 037 84.0 
Labor 2.41 | 41 99.2 
Test B 
Government | 2.61 | —_— tt 99.9 
Capital C5 233 50.0 
Labor = 1.18 L PS a3 99. 
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Figure 20. Comparison of mean 
scores of Rugg, non-Rugg groups by 
grades in Government, Test B. 
—- Rugg 
Non-Rugg 
/any sense be classed as radical. On the corm 
| trary, the community from which the seventh 


and eighth grade scores were obtained is re- 
puted to be a highly conservative community. 
The fact that the mean scores of the Rugg 

group on both Government tests indicated a 


|/more critical attitude toward the government 
| than was shown by the other group is signif- 


| icant, 


in view of the reputation for liberal- 
ism in the Rugg textbooks as contrasted with 
the rather extreme nationalistic trend of 

the more commonly used social science text- 


| books, 


The greater similarity of scores on Cap- 
ital for the two groups under consideration 
tends to give weicht to the explanation of 


| the negative scores for the reguler group 


/previously set forth. 


This explanation was 


| that the school propaganda for economic con- 


servatism has not been carried on as effec- 


= | tively as it has for political nationalism. 
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correlation with IQ and Socio-Economic ship. The techniques employed in this study 
Status | offer a further means for measuring the ef- 
fectiveness of teaching. However, they do 
For purposes of this part of the inves- | not furnish a basis for determining what is 
-igation, [Q's basedon the Otis Self-Administer=- | desirable in civic attitudes. They do not 
- Test of Mental Ability were obtained for | apply to the evils themselves, but they can 
pupils in the city group. Socio-econom- | be used to evaluate materials, methods, cur- 
> scores were secured from the entire city | ricula, and teachers. 
‘roup of 202 pupils by means of the Sims 
sore Card. Correlations were computed to SUMMARY AND CONCLUSIONS 
ietermine the relation between intelligence | 
nd the attitude of favorableness toward The purpose of this investigation was to 
vernment, Capital, and Labor, | discover and measure the attitudes toward 
In computing the correlations, both the | current political and economic problems of 
minus and plus scores were used in the se- | pupils in frades seven to twelve inclusive; 
ries on the assumption that all scores lying | to discover the extent to which attitudes are 
helow the highest plus score have an ingre- influenced by certain factors such as place 
lient of unfavorableness. So long ac favor- | of abode (whether in city, tow, or country), 
bleness outweighs unfavorableness, as meas- | socio-economic status, school gerade, and gen- 
ired by the test, the score will be plus. | eral intelligence; and to suggest some con- 
Yhen the two appear in practically equal tributions to civic trainins arising from the 
urounts, the score will be zero, Similerly, | study. 
wnen favorableness is outweighed by unfavor- An attitude test was prepared by the 
ableness, the score will be minus; only in | writer and administered to 612 pupils in six 
he most extreme minus scores is the ingre- | different schools, A subsidiary study was 
ient of favorableness practically absent. | made of 176 pupils located in two different 
There was no correlation between in=- | towns to determine the effect of textbook 
‘elligence and any one of the three fields. | material upon pupil attitudes. 
) correlation was found between the atti- | The following are some of the more im- 
tude of favorableness toward either the Gov- | portant results of the study: 
ernment or Capital and socio-economic sta- 1. The mean scores of all groups indicated 
tus, but a negative correlation of .39 with a distinctly favorable attitude toward the 
aP.e. of .O7 between the attitude of favor- | present order of government. 
ableness toward Labor and socio-economic 2. The rural group stood measurably higher 
status was obtained. It might be expected in its support of Government than either tom 
at a correlation of this kind would exist, | or city. 
since those of low socio-economic status in | 3. The tendency was toward a progressively 
a city will almost invariably belong to the | stronger support of Government from grades 
vace-earning class, seven to eleven, but a decline in the mean 
| score of the twelfth grade. 
contributions of Study to Civic Training 4. The attitude of the group as a whole 
; was slightly unfavorable to Capital in two 
Public school education has as one of tests, but favorable in a third test by a 
its goals the inculcation of desirable civic | reletively small margin. 
ttitudes, Educators have heretofore been 5. Opposition to Capital grew progressive- 
atisfied to determine the effectiveness of ly from city to town and to country. 
aching and the progress of pupils chiefly, | 6. No distinct grade trends in attitude 
i? not entirely, on the basis of measured | toward Capital were discovered, although the 
sults in terms of the content material of | eighth crade showed a tendency toward its 
the curriculum, This is not surprising in | support while the twelfth grade indicated a 
' of the fact that until recently no sat- | rather pronounced opposition as compared 
-tory measuring instruments have been | with the two preceding grades. 


‘leble for determining growth in citizen- | 7. The entire group indicated a pronounced 
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support of Labor, with: lefinite trends 
from sroup to group, 

8. A regular, althourh not pronounced, 


srade trend indicated a prorress!vely stronc- | 


i 
er support of Labor from the sev to 
twelfth grades. 

j fo" + tay . a] 
trination ¢ beliefs in United state his 
TV) 3 = [a + : | 
tory and covernment. This view is supported | 
by: (1) the nature of the ¢etat ts ft 
which it w necessary for a pil to sub- 
scribe and t which a ma; { crit 


in order to obtain a high plus re in Gov- | 
ernment, and (2) the fact that the trend 

from grade to grade was toward a prorre 

ly stronger support of Government. 


10, There was little evidence that the 
pupils studied bec any 
from grade to oe fwelftt ra 


were only slightly less likely to subscribe 
to extreme statements than those of the sev- | 
enth grade. 

11. There seemed to be a tendency toward 
attitude vroupine on the basis of a junior 
high school group (grades 7-6-9) and a se- 
nior hieh school croup (grades 10-11-12). 
This was especially noticeable in the test 
on Government. 

12. There is evidence, althouch not con- 
Clusive proof, that textbooks whic} 
political and economic problems from a lib- 
eral point of view will influence student 
attitudes accordingly. 

13. The techniques exemplified in this 
study should make possible a more careful 
scrutiny of the end products of social scierce 
teaching than has heretofore beer possible, 
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INSTRUCTIONS TO JUDGES 


Instructions for Sorting Statements of 
Opinion on Certain Economic and 
Political Questions 











Tests A and B 


You are being handed herewith a packet 
containing statements exvressing certain at- 
titudes on the following subjects: 

(G) The established order of government 
in the United States 
Capitalism as embodied in our Ameri- 
can economic practice 
Labor (including both organized 
unorganized labor) 


(C) 


(L) and 


| 
| 


| vorable or unfavorable attitudes, you 


Distribute the remaining slips upon the 
scale so as to express the varying degrees 
of favorableness or unfavorableness 
the present order of government. 

You may not need to use all of the 
eleven steps on the scale in distributing 
the statements but use them all if you think 
{t necessary. If you do not find any state- 
ments that indicate the most extremely fa- 


will 


toward 


| not need to use steps eleven or one, but or 
ly nine or ten and two or three as the out- 


| each pile represents. 


You are asked first, to sort these state-| 


ents into four piles, pile No. 1 to contain 


lusively to government, No. 2 those which 
refer at least principally to capitalism (our 
economic order), No. 3 those which refer to 
labor. There may be a few statements which 
in your judement do not relate to any of the 
subjects named above. If so, place such 
statements in a separate nile, place a band 
around them and label this pile X. 

Now begin witn your pile of statements 
relating to covernment and distribute them as 
follows: Visualize a scale before you con- 
tainine eleven steps thus: 


Scale 
18234 89 7 


Most un-=- 


favorable 6 91011 


vorable 


Place on step 1 of the scale all state- 
ments expressing an attitude that is com 
pletely unfavorable toward the established 
order of government in the United States and 
on step 11 all statements expressing the most 
extremely favorable attitude toward the es- 
tablished form of government. Place on step 
6 those statements which in your judgment 
show an attitude which is neutral or practi- 
cally so. 


| 
| 


| 
| 
| 
| 


| 
| 
| 


| 
| 


| possible I 
| evaluated by a 


| lowine fields: 


m | process is very important. 
those statements which refer primarily or ex-| 


side steps. 

Now number the top slip of each pile so 
as to indicate the step on the scale which 
Your first pile then 
would be No, 1 and so on through the scale, 
the last pile being No. 11. (This numbering 
Be sure to make 
no mistake at this point.) 

Now place one pile on top of another 
and place a band around the entire pile and 


| label it G for government. 


Follow exactly the same procedure with 
each of the other piles containing state- 
ments referring to capitalism and labor, 


Mark the pile of slips representing capital- 


ism C and the one representing labor L. 

Now place a band around the entire lot 
including the X pile and return them 
your instructor. 


to 


Test C 

I am making a survey of opinion on 
economic and political problems among ju- 
nior and senior high school pupils. In or- 
der that the test scores may be as valid as 
am having responses to the 
group of competent judges of 
which you have been chosen as one. 

The test covers attitude in the 


test 


fol- 


The established order of government 
in the United States 








Ss RTS ZI. PY 





Capitalism as embodied in our rican 
economic practice 
Labor (including both ormanized aud un 
organized labor) 


Instructions for Marking 

1. Read the pupil's instructi 
page 2, notine that a pupil marks t state- 
ment (+) if he accepts and (-) if he reject: 
the statement. 

2. Beginning on page 3 and continuing 
to the end of the test are a number of par- 
agrapnhs under the title, Statement of Fact, 
each one designated by a Roman numeral, Be- 
neath each Statement of Fact are four re- 
sponses, some of which are marked (+) by the 
pupil indicatine that he accepts the 
ment; some are marked (-) indicatine that he 
rejects the statement, 

3. It is these responses numbered 1, 2, 
3, 4 under each Statement of Fact which you 
are to evaluate, taking into account in your 
‘valuation whether the pupil has accepted it 
(+) or rejected it (-). 

4. Read the Statement of Fact, 

5. Read each of the responses below 
the Statement of Fact, 

6, On the extreme left-hand margin, ot- 
side of the parentheses, indicate the field 
of attitude to which in your judpment each 
response relates, and whether it ts in fa- 
vor of or against a particular field of ate 
titude. Note: Be sure to understand that 
it is the pupil's marking (+) or (-) of 
these responses which you are to evaluate; 
i.@., what attitude, if anv, is indicated 
by a (+) marking and what attitude bya (-) 
merkins on the part of the pupil. 

7. Use the following markine system: 
> = capitalism, G = goverrment, L = labor 

up. Plus = in favor of, Minus = against, 


ro = no attitude. Thus, if you be- 
lieve a response indicates an attitude in 
‘avor of capitalism you will mark C_ on the 
margin opposite this response, if against 
rapitalism -C, etc. 

8, If the response indicates an atti- 
le in more than one field indicate this 
follows: L, -<G, for example, would mean 

that the statement indicates an attitude in 
favor of the labor group and against govern 





and Ze 


. If in your judgement a response ind 
ral attitude mark the iten 
‘*« 
10, Be sure to mark every response, 
i I INI i i ‘ 
t | j SAL I Ls tul 


Instructions for Administering 


1. Time allotted for test, 30 minut 
not includine time spent in filling in 
blanks on the front page of test. 

2. After each pupil has been provided 
with a copy of the test instruct pupils t 
fill in all blanks on the front cover of 


Lf an +} SVIAws wer eta - 
4. read the followine statement to 


In the folder before you are 
ee tests, Test A, Test B, and 
Test C. You are asked to read the 
tructions carefully for each 
st. An illustration is given, 
showing how to mark each test. 
Study these illustrations careful- 
ly, especially if you are not sure 
that you understand the instruc- 
tions, 

"Your answers will be looked 
upon as possessing real value, al- 
thoush the results of the test wil} 
in no way affect your school zrades, 
Mark each iten thoughtfully and 
carefully. 

"You should be sure to note 
that a test does not end at the bot- 
tom of the page. Watch the top of 
the next page for the remainder of 
the test. 

"Be sure not to omit any item, 

"You will be allowed 30 min- 
utes to complete your work on the 
three tests, 

"Now turn the page to Test A 
and begin. Continue through the 
three tests without stopping." 

4. In order that the data shall be 
alid as possible, teachers are asked that 
reasonable precautions be taken against pu- 
pils! copying their work from one another, 
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